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Field of the Invention 

[0001] This invention relates to a system, method, and program for identifyinjg a 
corresponding translation, more specifically, a translation identifying system that identifies at 
least a translation of a word to be translated in an original sentence written in a source 
language, a method for identifying a translation applicable to the translation-identifying 
system, and a program for allowing a computer to function as the aforementioned translation- 
identifying system. 
Description of related art 

[0002] Realization of a machine translation system that translates a sentence (an original 
sentence) described in a certain natural language (a source language) into a sentence 
(translated sentence) described in another language (a target language) hcis been expected for 
a long time, and various improved technologies relating to the machine translation have been 
proposed. 

[0003] For example. Patent Document 1 discloses a technology wherein a set of expression 
forms represented by a phrase is stored in an English- Japanese phrase dictionary installed on 
an HD device, an expression form composed of words coupled by a coordinating conjunction 
is retrieved in English text in a syntax analysis process, and when the retrieved expression 
forms are stored in the English- Japanese phrase dictionary, or when a prefix or a suffix of a 
word composing the retrieved expression form is identical, the retrieved expression form is 
recognized as a morpheme to analyze a sentence structure without separating the words. 
[0004] Moreover, Patent DocTxment 2 discloses a technology wherein a characteristic table 
correlating meanings and characteristics of each multivocal word is stored in advance, a 
characteristic record of the entered original sentence written in the first language is produced, 
and a meaning of a multivocal word is selected and displayed based on a characteristic 
possessed by the original sentence by comparing the produced characteristic table with the 
stored characteristic table. 

Patent Document 1 : Publication ntunber 11-328178 
Patent Document 2 : Publication number 6-314294 
Summary of the invention 
Problem to be solved by the invention 

[0005] The work of translating an original sentence described in a source language into a 
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translated sentence described in a target language is done through a translation process 
wherein words in the original sentence described in the source language are converted to 
words of the target language, and the converted words are rearranged according to a 
grammatical rule of the target language based on the recognition that a sentence is usually 
composed of words and a granmiar. 

This translation process is adopted by the machine translation that performs translation using 
a computer, where the system that corresponding translations are registered in a dictionary by 
word unit, words are extracted from the original sentence in sequence, corresponding 
translation of the extracted words are repetitively retrieved to replace the original sentence 
with a corresponding translation by word unit, a sentence structure is analyzed by identifying 
a part of speech of each word, and the corresponding translations by word imit are rearranged 
according to the grammatical rules of the target language based on the results of an analysis 
of the sentence structure to obtain translated sentences (corresponding translations) is 
generally used. 

[0006] In machine translation, even if a stereotyped phrase exists in the original sentence, 
there is a possibility that a suitable translation of the abovementioned phrase is obtained by 
applying the technology described in Patent Document 1, and even if a multivocal word 
exists in the original sentence, there is a possibility that a specific translation of the 
abovementioned multivocal word is obtained by applying the technology described in Patent 
Document 2. 

(Where the technology described in Patent Document 2 requires complicated work such that 
a multivocal word is extracted and meanings and characteristics of the extracted multivocal 
word are analyzed to prepare a characteristic table in advance.) 

However, in the machine translation, since it often happens that the translated sentence is 
found to be unnatural as a sentence of the target language even if the obtained translation has 
no grammatical errors and translation by word imit has no errors, actual circumstances are 
that translation accuracy suitable for practical use cannot be obtained by the existing machine 
translation technologies even if the technologies described in, for example. Patent Documents 
1 and 2 are applied. 

[0007] The causes of these circumstances are estimated to be the fact that the translation 
processes adopted in the machine translation differs significantly from the process of creating 
a sentence when people speak or write a sentence in their native language. 
That is to say, when people speak or write a sentence in their native language, the process 
wherein they retrieve words and create sentences by applying grammatical rules to the words 
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is not used, but they actually create sentences by connecting suitable words, phrases, and . 
sentences which they retrieved (selected depending on a situation) from a huge number of 
chunks (mass of meanings: words, phrases, stereotyped expressions, collocations, sentence 
structures, and sentences) accumulated in an individual brain based on the relation to the 
contextual words before and afler the word concemed and the background knowledge 
associated with the phrase itself. 

[0008] Therefore, natural translations as sentences in the target language can be obtained by 
seeking corresponding translations of the original sentence after recognizing the contextual 
relations to the words before and after the word concemed, the background knowledge 
associated with the phr2ises themselves, arid the units (that is to say, chunks) that people 
retrieve when they speak or write in their native language in the same way as when people 
speak or write in their native language. However, actual circumstances were that creation of 
translated sentences by seeking corresponding translations by chxmks was not yet realized in 
the machine translation system because it is not easy to identify the context or the 
background knowledge associated with phrases themselves, and punctuation of chunks is 
obscure. 

[0009] This invention was established taking the abovementioned facts into consideration, 
and has a purpose to obtain corresponding translation-identifying systems, corresponding 
translation identifjdng methods, and programs. 
Measures to work out the assignment 

[0010] In order to achieve the abovementioned purpose, the translation-identifying system 
relating to the invention according to a first embodiment comprises a storage means for 
storing a plurality of natural sentences composed of a plurality of words in the source 
language by correlating the natural sentences with the translations in the target language, a 
retrieval means for retrieving natural sentences containing phrases to be translated in original 
sentences in a source language from a plurality of natxiral sentences in said source language 
stored in said storage means, search means for searching natural sentences extracted by 
retrieval of said retrieval means for said phrase to be translated and frequently appearing 
translations appearing in the same sentence of said source language, and a translation 
identifying means for searching translated sentences of natural sentences containing each of 
said phrase to be translated and specific frequently appearing phrases for frequently 
appearing translations of phrase to be translated by referring to said specific frequently 
appearing phrases found by said search means and existing in said original sentences and 
referring to translated sentences of natural sentences containing each of said phrase to be 



4 



translated among natural sentences extracted by retrieval of said retrieval means and for 
identifying found frequently appearing translations as translations of said phrases to be 
translated in said original sentences. 

[0011] In the invention according to Claim 1, natural sentences composed of a plurality of 
words in the source language are stored in the storage means by correlating the natural 
sentences with the translations in the target language. 

The natural sentence relating to this invention is at least one of a sentence, a clause, a phrase, 
a stereotyped expression, and a collocation that are not subjected to editing or processing, 
such as division by word unit and extraction of multivocal words as in a dictionary in a 
conventional machine translation system, eliminates the necessity of providing a multivocal 
word with many corresponding translations as in the case of registration in a dictionary by 
word unit by storing these natural sentences by correlating these sentences with the 
translations in the target language (when a multivocal word is correlated with many 
translations, there is a possibility of erroneous selection because a suitable translation must be 
chosen) and can transform the translation corresponding to a natural sentence to natural 
sentences in the target language. 

Moreover, words and their corresponding translations in addition to the natural sentences may 
be stored in the storage means in this invention. 

[0012] In the invention according to Claim 1, a natural sentence that contains phrases to be 
translated in the original sentence described in the source language (any one of a sentence, a 
clause, a phrase, a stereotyped expression and a collocation) is retrieved by the retrieval 
means from a plurality of natural sentences in the source language stored in the storage 
means. 

The phrase to be translated that means the phrase especially to be translated in the original 
sentence may be a word or may be composed of a plurality of words. 
For example, when the translation-identifying system relating to this invention is used by a 
person (translator) as an electronic dictionary for translation work, the phrase to be translated 
is specified by the translator. 

Moreover, for example, when the translation-identifjdng system relating to this invention is 
used as a part of a machine translation system or an automatic interpretation system, the 
phrase to be translated is specified by the machine translation system or the automatic 
interpretation system that performs machine translation using the corresponding translation 
identified by the translation-identifying system relating to this invention. 
In addition, the original sentence containing the phrase to be translated may be also specified 
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by a user (translator, etc.) or may be specified automatically (for example, a sentence or a 
clause containing the phrase to be translated is automatically identified as the original 
sentence). 

[0013] In the invention according to the first embodiment, a search means searches the 
natural sentences extracted by the retrieval of the retrieval means for the phrases to be 
translated and the fi-equently appearing phrases that fi-equently appear in the same sentences 
of the source language, a translation identifying means identifies translations of at least said 
phrase to be translated in translated sentences of natural sentences containing each of 
altemative phrases identified by said identifying means and said phrase to be translated 
among natural sentences extracted by retrieval of said retrieval means, as translations of at 
least said phrase to be translated in said original sentences. 

[0014] When a phrase to be translated and specific fi-equently appearing phrases that 
frequently appear (called correlation) in the same sentence in the source language exist in the 
original sentence, there is a high probability that the translation of the phrase to be translated 
coincides with the translation of the phrase to be translated in the corresponding translation of 
a natural sentence containing each of the phrases to be translated and specific frequently 
appearing phrases. 

However, although there is a high probability that the phrases to be translated are identical to 
a suitable translation of the phrases to be translated in the corresponding translation of a 
natural sentence containing each of the specific frequently appearing phrases, there also is a 
possibiUty that natural sentences having unsuitable translations of phrases to be translated are 
mixed in the abovementioned natural sentences stored in the storage means. 
Based on this, in the invention according to the first embodiment, since frequently appearing 
phrases having a high correlation with phrases to be translated are searched for, frequently 
appearing translations for phrases to be translated and phrases to be translated in the 
translation of a natural sentence containing each of the specific frequently appearing phrases 
are found by referring to the specific frequently appearing phrases existing in the original 
sentence among the recognized frequently appearing phrases and the translation of a natural 
sentence containing each of the phrases to be translated. Therefore, suitable translations of 
phrases to be translated in the original sentence (corresponding translation where there is a 
high probability that natural translations are obtained as sentences in the target language from 
the original sentence) can be obtained based on the frequently appearing phrases even if it is 
difficult to identify (select) a suitable natural sentence based only on the degree of 
coincidence. 
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[0015] It is difficult to identify the suitable translation taking the contexts before and after a 
word into consideration because there are many options depending on the situation. In the 
invention according to the first embodiment, the corresponding translation of a phrase to be 
translated is identified by focusing on the phrase that frequently appears in the same text at 
the same time and by referring to the corresponding translation sentence of a natural sentence 
where this phrase and the phrase to be translated appear at the same time. Therefore, it is not 
necessary to take the contexts before and after the original sentence into consideration. As a 
result, a suitable translation of a multivocal word can be obtained roughly considering the 
contexts before and after the original sentence. 

It is possible to search for the abovementioned frequently appearing phrases by registering 
phrases that frequently appear in the same text of the source language in a table and referring 
to the table. In the invention according to the first embodiment, since the frequently 
appearing phrases are found based on the natural sentences extracted by the retrieval of the 
retrieval means, there are two advantages wherein time and labor for preparing the 
abovementioned table can be saved, and the storage capacity required for storing the table 
can also be saved. 

[0016] The translation identifying system relating to the invention according to a second 
embodiment comprises a storage means for storing a plurality of natural sentences, of original 
sentences composed of a plurality of words, correlated with translated sentences in a target 
language, a retrieval means for retrieving natural sentences containing phrase to be translated 
in original sentences in a source language from a plurality of natural sentences in said source 
language stored in said storage means, an identifying means for identifying alternative 
phrases existing in said original sentences and replaceable with targeted phrases not 
contained in natural sentences extracted by retrieval of said retrieval means, an identifying 
means for identifying alternative phrases existing in said original sentences and replaceable 
with targeted phrases not contained in natural sentences extracted by retrieval of said retrieval 
means, a translation identifying means for identifying translations of at least said phrase to be 
translated in translated sentences of natural sentences containing each of alternative phrases 
identified by said identifying means and said phrase to be translated among natural sentences 
extracted by retrieval of said retrieval means, as translations of at least said phrase to be 
translated in said original sentences. 

[0017] In common with the invention as in the first embodiment, multiple natural 
sentences in the source language, respectively composed of multiple words, are stored in 
memory in the invention embodied in the second embodiment, corresponding to the parallel 
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version in a target language out of which natural sentences containing the targeted phrases in 
the original text in a source language are retrieved by the search tool. 
A natural sentence where a specific phrase in the original sentence is replaced with another 
phrase is sometimes extracted by the retrieval of the retrieval means. In this case, if meanings 
of the original sentence and the natural sentence are similar, the specific phrase is considered 
to have a replaceable relation to the other phrase. 

Even if a pluraUty of natural sentences is extracted by the retrieval of the retrieval means, 
when a natural sentence where a specific phrase for the original sentence is replaced with the 
other phrase (alternative phrase) having a replaceable relation to the specific phrase exist in 
these natural sentences, it is preferable to select this natural sentence because there is a high 
probability that this natural sentence and the original sentence have the same meaning. 
[0018] Based on this, in the invention according to the second embodiment, an altemative 
phrase that is replaceable with a targeted phrase extracted by the retrieval of the retrieval 
means and not contained in the natural sentence existing in the original sentence is identified 
by the identifying means, and the translation identifying means identifies the translation of 
the altemative phrase identified by the identifying means and of at least the phrase to be 
translated in the corresponding translation sentence of a natural sentence containing each of 
the phrases to be translated among the natural sentences extracted by the retrieval of the 
retrieval means as the translation of at least the phrase to be translated in the original sentence. 
Thus, based on an altemative phrase, suitable translations of phrases to be translated in 
original sentences having a high probability of obtaining natural translated sentences as 
sentences in a target language fi-om the original sentences in a source language can be 
obtained. 

[001 9] Identification of an altemative phrase by the identifying means relating to the 
invention according to the second embodiment may be performed by registering phrases 
having replaceable relations with each other in a table and referring to this table. For example 
the identification may be performed by retrieving a natural sentence containing a targeted 
phrase firom a plurality of natural sentences stored in the storage means, retrieving a natural 
sentence having a sentence stmcture identical to the natural sentence extracted by this 
retrieval from the natural sentences stored in the storage means, and identifying the phrase 
replaced with the targeted phrase in the natural sentence extracted by this retrieval as the 
altemative phrase as embodied in the third embodiment. 

Also, in this case, time and labor for preparing the abovementioned table can be saved, and 
the storage capacity required for storing the table can be also saved. 



8 



[0020] Since an immeasurable amoimt of background knowledge associated with phrases . 
themselves having various properties depending on these phrases exist, it is difficult to 
extract all of them to classify into categories. 

In the inventions according to the second and third embodiments, since a phrase replaceable 
in the same sentence structure is identified as a phrase in the same category, it is possible to 
obtain suitable translations roughly based on the backgroimd knowledge associated with 
phrases themselves without obtaining background knowledge of entire phrases to classify 
them into categories. 

[0021] Li the invention according to the first or second embodiment, as embodied in, for 
example, a fourth embodiment, it is preferable that a translation identifying means evaluates a 
degree of coincidence between natural sentences extracted by retrieval of said retrieval means 
and said original sentence and identifies translations of at least said phrase to be translated in 
translated sentences of natural sentences selected on the basis of said evaluated degree of 
coincidence, as translations of at least said phrase to be translated in said original sentence. 
Thus a natural sentence similar to the original sentence among natural sentences extracted by 
the retrieval of the retrieval means (for example, a natural sentence used with the same 
meaning as the multivocal word existing in the original sentence) is selected with high 
probability as a natural sentence with a high degree of coincidence, a natural translation 
corresponding to the selected natural sentence is obtained, and the translation of, at least, the 
phase to be translated in the translations are identified as the translation of the phrase to be 
translated in the original sentence (it is a matter of course that the translation of a phrase other 
than the phrase to be translated in the corresponding translation can be identified as the 
translation of the phrase in the original sentence depending on the degree of coincidence with 
the original sentence). 

[0022] Thus the invention according to the fourth embodiment, considering the great 
technical difficulty of identifjdng punctuation of chunks in the original sentence, stores a 
plurality of natural sentences in the source language that correspond to the translations in the 
target language, selects a natural sentence having a high degree of coincidence with the 
original sentence in the source language from the stored natural sentences, and identifies the 
translation in the natural translations corresponding to the selected natural sentences as the 
translation of, at least, the phrase to be translated in the original sentence. Therefore the 
invention enables the obtaining of natural translations equal to the translation corresponding 
to the original sentence performed generally by chimk as a result that it is not necessary to 
identify the punctuation of the chunk in the original sentence and enables the obtaining of 
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suitable translations of the phrases to be translated in the original sentence (the translation 
having a high probability of obtaining from the original sentence a natural translation as a 
sentence in the target language). 

Moreover, since the invention according to the fourth embodiment determines a degree of 
coincidence between a natural sentence extracted by the retrieval of the retrieval means and 
the original sentence, and can obtain the suitable translation by selecting a natural sentence 
based on the determined degree of coincidence, it is not necessary to perform complicated 
processing, such as identification of a part of speech and an analysis of sentence structure as 
in the conventional machine translation system, and can simplify the processing. 
[0023] The parallel translation identifying system related to the invention as defined in a fifth 
embodiment consists of the following tools: the memorizing means storing multiple natural 
sentences composed of multiple words in a source language coordinating with the parallel 
sentences in a target language; the retrieval tool searching natural sentences containing the 
object phrases to be rendered in the original text in the source language out of the multiple 
natural sentences in the source language retained in the above memory; and the parallel 
translation identifying system processing the minimal translation target phrase in the natural 
parallel version selected based on the total value of the first evaluated value computed as the 
minimal object phrase in the original text by respectively adding the first evaluated value 
corresponding to the small distance between the phrase to be rendered and the respective 
words in the original text, identifying the words matching the original from among the 
respective words in the natural sentence in question about the natural one extracted through 
the retrieval tool, and by computing by the natural sentence for which the total value of the 
said first evaluated value added to the corresponding words were recognized. 
[0024] The invention as embodied in the fifth embodiment is equipped with the same 
memory and search means as embodied in the first and second embodiments with the first 
evaluated value individually assigned corresponding to the small distance between the 
respective words and the phrase to be rendered in the original text, recognizing the matching 
words with the original from among the respective words in the relevant natural sentences 
extracted by the retrieval imit, computing by the natural sentence the total value "of the first 
value given to the matching words recognized, based on which identifying the parallel 
version of the minimal object phrases in the selected natural sentence as the corresponding 
translation of the minimal target phrases in the original text. Therefore, on the basis of the 
total value of the first evaluated value, being capable of selecting the natural sentence 
containing more words located in the proximity of the target phrases to be rendered in the 
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original text, i.e., a natxiral sentence with the high possibihty of containing a collocation of 
phrases composed of the target phrases and words in the vicinity to be rendered in the 
original text (a natural sentence assumed with higher degree of similarity in the alignment 
sequence of the words compared with the original) can be chosen, thus, acquiring an 
appropriate parallel translation of the phrases in the original to be rendered with the high 
possibihty of producing a natural rendition as a sentence in the target language from the 
original text in the source language (parallel rendition with higher possibility of natural 
translation from the original text into the target language). 

[0025] In addition, in the invention as embodied in the fifth embodiment, it will be 
preferable to configure the parallel translation identifying means in such a way as embodied 
in a sixth embodiment, if there are multiple sentences from the computed result of the first 
evaluated value, about the respective natural sentences of such multiplicity, that the parallel 
translation of the minimal phrases to be rendered in the parallel translation of the natural 
sentence selected based on the computed total value of the second evaluated value can be 
identified as the parallel translation for the minimal phrases in the original text, respectively 
assigning the second evaluated value according to the small distance in the above natural 
sentence from the object phrases for rendering in the natural sentence for the matching words 
in the said sentence, computing the total value of the second evaluated value given to the 
corresponding words aforementioned by the natural sentence. 

Thus, if there are multiple sentences from the computed result of the first evaluated value, on 
the basis of the total value of the second evaluated value, the natural sentence containing a 
larger nxmiber of the words coinciding with the original sentence and containing the 
mismatched words existing between the coinciding words as small as possible, i.e., a natural 
sentence with the high possibility of containing a collocation of phrases composed of the 
target phrases and words in the vicinity to be rendered in the original text (a natural sentence 
assumed with higher degree of similarity in the alignment sequence of the words compared 
with the original) can be chosen, thus, acquiring an appropriate parallel translation of the 
phrases in the original to be rendered with the high possibility of producing a natural 
rendition as a sentence in the target language from the original text in the source language 
(parallel rendition with higher possibility of natural translation from the original text into the 
target language). 

[0026] The parallel translation identifying system related to the invention as embodied in 
the seventh embodiment consists of the following means and unit: the memorizing means 
storing multiple sentences in the source language composed of multiple words coordinating 
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the rendered parallel sentences of the target language; the search unit retrieving the natural 
sentence containing the target phrases to be rendered in the original text of the source 
language from the multiple natural sentences in the soxirce language stored in the above 
memory; and the parallel translation identifying system recognizing the rendition of the said 
minimal parallel phrases in the parallel natural sentences selected based on the total value of 
the third evaluated value computed as the parallel translation of the minimal parallel phrases 
in the original text in question, searching, to the natural sentence extracted by the search unit, 
from among the anterior word groups aligned toward the front side from the object phrases to 
be rendered in the original text, whether the specific anterior words non-retrieved and with 
minimum distance from the object phrases to be rendered are located toward the front of the 
object phrases to be rendered in the natural sentence or the anterior words extracted in the 
preceding search within the prescribed nimiber of words, repeatedly assigning the third 
evaluated value according to the minimal distance between the specific anterior words upon 
detection and the object phrases to be rendered or the anterior words extracted in the 
preceding search xmtil the non-retrieved anterior words have ceased to exist in the 
aforementioned original text, and concomitantly searching, to the natural sentence extracted 
by the search unit, from among the posterior word groups aligned toward the back from the 
object phrases to be rendered in the original text, whether the specific posterior words non- 
retrieved and with minimum distance from the object phrases to be rendered are located 
toward the back of the object phrases to be rendered in the natural sentence or the posterior 
words extracted in the preceding search within the prescribed number of words, repeatedly 
assigning the third evaluated value according to the minimal distance between the specific 
posterior words upon detection and the object phrases to be rendered or the posterior words 
extracted in the preceding search until the non-retrieved posterior words have ceased to exist 
in the aforementioned original text, thus, computing the total value of the third evaluated 
value by the natural sentence. 

[0027] The invention as embodied in the seventh embodiment is equipped with the same 
memorizing means and the search unit as embodied in the first and second embodiments, 
with the parallel translation identifying system, searching, to the natural sentence extracted by 
the search unit, from among the anterior word groups aligned toward the front side from the 
object phrases to be rendered in the original text, whether the specific anterior words non- 
retrieved and with minimum distance from the object phrases to be rendered are located 
toward the front of the object phrases to be rendered in the natural sentence or the anterior 
words extracted in the preceding search within the prescribed number of words, repeatedly 
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assigning the third evaluated value according to the minimal distance between the specific 
anterior words upon detection and the object phrases to be rendered or the anterior words 
extracted in the preceding search until the non-retrieved anterior words have ceased to exist 
in the aforementioned original text, and concomitantly searching, to the natural sentence 
extracted by the search unit, firom among the posterior word groups aligned toward the back 
firom the object phrases to be rendered in the original text, whether the specific posterior 
words non-retrieved and with minimimi distance firom the object phrases to be rendered are 
located toward the back of the object phrases to be rendered in the natural sentence or the 
posterior words extracted in the preceding search within the prescribed number of words, 
repeatedly assigning the third evaluated value according to the minimal distance between the 
specific posterior words upon detection and the object phrases to be rendered or the posterior 
words extracted in the preceding search until the non-retrieved posterior words have ceased 
to exist in the aforementioned original text, thus, computing the total value of the third 
evaluated value by the natural sentence. Since the object phrases to be rendered in the object 
sentence of the natural sentence are at least identified based on the total value of the third 
evaluated value as the object phrases to be rendered in the original text at least, the similarity 
in the word alignment sequence is quite high compared to the original text on the basis of the 
total value of the third evaluated value, and concomitantly, a natural sentence with the least 
number of discordant words, i.e., the natural sentence with the original text excluding 
discordant words as much as possible (status of tighter group of words) can thus be chosen 
and created with a higher possibility of obtaining a natural parallel version with more 
appropriate phrases as a sentence of the target language rendered fi*om the original text in the 
source language. 

[0028] The parallel tremslation identifying system related to the invention as embodied in en 
eighth embodiment consists of the following means and unit: the memorizing means storing 
multiple sentences in the source language composed of multiple words coordinating the 
rendered parallel sentences of the target language; the search unit retrieving the natural 
sentence containing the target phrases to be rendered in the original text of the source 
language fi-om the multiple natural sentences in the source language stored in the above 
memory; and the parallel translation identifying system recognizing at least the parallel 
version of the object phrases to be rendered in the parallel natural sentence selected based on 
the total value computed fi-om the number of words by the first distance regarding the front 
side and from the nxmiber of words by the first distance regarding the back side as at least the 
parallel version for the object phrases to be rendered in the original text, searching, regarding 
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the natural sentence extracted through the process by the search unit, to see whether the 
specific anterior words non-retrieved and with minimal distance from the object phrases to be 
rendered are aligned on the fi^ont side of the object phrases to be rendered in the natural 
sentence or of the anterior words extracted in the preceding search process within the 
prescribed nimiber of words fi-om among the first anterior group of words located on the firont 
side fi-om the object phrases to be rendered in the original text, repeating the process of 
counting up the number of words by the first distance regarding the fi-ont side corresponding 
to the distance firom the object phrases to be rendered when the specific anterior words are 
detected or the fi'om anterior words extracted in the preceding retrieval process till the non- 
retrieved anterior words have ceased to exist in the first anterior group of words, and 
concomitantly, searching to see whether the specific posterior words non-retrieved and with 
minimal distance fi'om the object phrases to be rendered are aligned on the back side of the 
object phrases to be rendered in the natural sentence or of the posterior words extracted in the 
preceding search process within the prescribed number of words firom among the first 
posterior group of words located on the back side firom the object phrases to be rendered in 
the original text, repeating the process of counting up the number of words by the first 
distance regarding the back side corresponding to the distance fi-om the object phrases to be 
rendered when the specific posterior words are detected or the fi-om posterior words extracted 
in the preceding retrieval process till the non-retrieved posterior words have ceased to exist in 
the first posterior group of words. 

[0029] The invention as embodied in the eighth embodiment is equipped with the same 
memorizing means and the search unit as in the first and second embodiments, with the 
parallel translation identifying system recognizing at least the parallel version of the object 
phrases to be rendered in the parallel natural sentence selected based on the total value 
computed from the number of words by the first distance regarding the front side and from 
the number of words by the first distance regarding the back side as at least the parallel 
version for the object phrases to be rendered in the original text, searching, regarding the 
natural sentence extracted through the process by the search unit, to see whether the specific 
anterior words non-retrieved and with minimal distance from the object phrases to be 
rendered are aligned on the front side of the object phrases to be rendered in the natural 
sentence or of the anterior words extracted in the preceding search process within the 
prescribed number of words from among the first anterior group of words located on the front 
side from the object phrases to be rendered in the original text, repeating the process of 
counting up the number of words by the first distance regarding the front side corresponding 
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to the distance from the object phrases to be rendered when the specific anterior words are ■ 
detected or the from anterior words extracted in the preceding retrieval process till the non- 
retrieved anterior words have ceased to exist in the first anterior group of words, and 
concomitantly, searching to see whether the specific posterior words non-retrieved and with 
minimal distance from the object phrases to be rendered are aligned on the back side of the 
object phrases to be rendered in the natural sentence or of the posterior words extracted in the 
preceding search process within the prescribed nimiber of words from among the first 
posterior group of words located on the back side from the object phrases to be rendered in 
the original text, repeating the process of counting up the number of words by the first 
distance regarding the back side corresponding to the distance from the object phrases to be 
rendered when the specific posterior words are detected or the from posterior words extracted 
in the preceding retrieval process till the non-retrieved posterior words have ceased to exist in 
the first posterior group of words. Thus, the similarity in the word alignment sequence is 
quite high compared to the original text on the basis of the total value computed from the 
number of words by the first distance regarding the front side and from the number of words 
by the first distance regarding the back side, and concomitantly, a natural sentence with the 
least number of discordant words, i.e., the natural sentence with the original text excluding 
discordant words as much as possible (status of tighter group of words) can be chosen and 
created with a higher possibility of obtaining a natural parallel version with more appropriate 
phrases as a sentence of the target language rendered from the original text in the source 
language. 

[0030] Moreover, the parallel translation identifying system in the invention as embodied in 
the eighth embodiment as in [[9]] a ninth embodiment, for example, is located on the front 
compared to the object phrases to be rendered in the original text toward the natural sentence 
extracted by the search unit, exploring whether the specific front words non-retrieved and 
with minimal distance from the object phrases to be rendered among the second anterior 
group of words excluding the anterior words adjacent to the object phrases to be rendered are 
aligned on the front of the object phrases to be rendered in the natural sentence or anterior 
group of words extracted in the preceding search and within the prescribed number of words, 
repeatedly counting up 1 to the distance between the specific front words when discovered 
and the object phrases to be rendered or the front words extracted in the preceding search or 
adding the nimiber of words by the second interval in relation to the corresponding front side 
to the distance added to the interval computed regarding the front-side words discovered in 
the preceding search until the non-retrieved front words have ceased to exist in the original 
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text, concomitantly selecting as the final recognition the result of a shorter distance betwieen 
the respective words firom among the number of words by the first and second intervals 
regarding the fi-ont side respectively, and also, toward the natural sentence extracted by the 
search unit, exploring whether the specific back-side words non-retrieved and with minimal 
distance fi-om the object phrases to be rendered among the second posterior group of words 
excluding the posterior words adjacent to the object phrases to be rendered are aligned on the 
back of the object phrases to be rendered in the natural sentence or posterior group of words 
extracted in the preceding search and within the prescribed number of words, repeatedly 
counting up 1 to the distance between the specific back-side words when discovered and the 
object phrases to be rendered or the back-side words extracted in the preceding search or 
adding the number of words by the second interval in relation to the corresponding back side 
to the distance added to the interval computed regarding the back-side words discovered in 
the preceding search until the non-retrieved back-side words have ceased to exist in the 
original text, concomitantly selecting as the final recognition the result of a shorter distance 
between the respective words fi-om among the number of words by the first and second 
intervals regarding the back side respectively, performing a comprehensive evaluation by 
adding up the final recognition on the anterior and posterior sides, thus, preferably 
configuring and identifying at least the parallel version of the object phrases to be rendered in 
the parallel sentence of the natural sentence of the result with shorter intra-word distance in 
the computed integrated recognition as the parallel translation at least of the target phrases to 
be rendered in the original text. 

As shown in the above, by using the number of words by the second distance in combination 
with those by the first distance, the degree of similarity in the alignment sequence of the 
words compared to the original text can be justly evaluated by the number of words by the 
second distance also regarding the natural sentence the sequence of the part of which the 
words have been changed toward the original text, thus, improving the probability of 
obtaining more a appropriate parallel version of the object phrases to be rendered in the 
original text. 

[0031] The translation identifying method relating to the invention according to a tenth 
embodiment comprises a first step for retrieving natural sentences containing phrases to be 
translated in the original sentence in the source language fi-om natural sentences in the source 
language composed of a plurality of words stored in the storage means by correlating the 
natural sentences with the translations of the target language, a second step for searching the 
natural sentences extracted by the retrieval of the retrieval means in the aforementioned first 
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step for the aforementioned phrases to be translated and the frequently appearing phrases that 
frequently appear in the same sentences of the source language, and a third step for searching 
for the aforementioned phrases to be translated and the frequently appearing translations for 
the phrases to be translated in the corresponding translation sentences of the natural sentences 
containing each of the aforementioned specific frequently appearing phrases and identifying 
the found frequently appearing translations as the translations of the aforementioned phrases 
to be translated in the original sentence by referring to the specific frequently appearing 
phrases found in the aforementioned second step and existing in the aforementioned original 
sentence among the natural sentences extracted by the retrieval in the aforementioned first 
step and referring to the corresponding translation sentences of the aforementioned natural 
sentences each of which contains the aforementioned phrases to be translated. Therefore in 
the same manner as the invention according to the first embodiment, the suitable translations 
of the phrases to be translated in the original sentence (the translation having high probability 
of obtaining from the original sentence a natural translation as a sentence in the target 
language) can be obtained. 

[0032] The translation identifying method relating to the invention according to an eleventh 
embodiment comprises a first step for retrieving natural sentences containing phrases to be 
translated in the original sentence in the source language from natural sentences in the source 
language composed of a plurality of words stored in the storage means by correlating the 
natural sentences with the translations of the target language, a second step for identifying 
alternative phrases existing in said original sentences and replaceable with targeted phrases 
not contained in natural sentences extracted by retrieval of said first step, and a third step for 
identifying translations of at least said phrase to be translated in translated sentences of 
natural sentences containing each of altemative phrases identified by said second step and 
said phrase to be translated among natural sentences extracted by retrieval of said first step, 
as translations of at least said phrase to be translated in said original sentences. Thus, in the 
same manner as the invention according to the second embodiment, suitable translations of 
phrases to be translated in the original sentence (the translation having high probability of 
obtaining from the original sentence a natural translation as a sentence in the target language) 
can be obtained. 

[0033] The parallel translation identifying system related to the invention as defined in a 
twelfth embodiment consists of two steps: the first step to search the natural sentence 
composed of object phrases to be rendered in the original text in a source language from out 
of the natural sentences in a source language composed of multiple words respectively stored 
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in multiples in memory in response to the parallel version of the target language; and the 
second step to identify the parallel version of the object phrases to be rendered at least in the 
parallel sentence of the natural sentence selected based on the total value of the first 
evaluated value computed at least as the parallel version of the object phrases to be rendered 
in the original text, respectively granting the first evaluated value corresponding to the small 
distance in the original text from the object phrases to be rendered in the original text toward 
the respective words in the original text, recognizing the words concordant with the original 
text from among the respective words in the relevant natural sentence in relation to the 
natural sentence extracted by the search process in the first step, thus, computing by the 
natural sentence the total value of the first evaluated value granted to the corresponding 
words identified. Therefore, in the same way as the invention as defined in a fifth 
embodiment, an appropriate parallel translation can be obtained for the object phrases to be 
rendered in the original text (i.e., parallel version with a higher possibility of acquiring a 
natural sentence as a sentence in the target language from the original text). 
[0034] The parallel translation identifying system related to the invention as defined in a 
thirteenth embidoment consists of two steps: the first step to search the natural sentence 
composed of object phrases to be rendered in the original text in a source language from out 
of the natural sentences in a source language composed of multiple words respectively stored 
in multiples in memory in response to the parallel version of the target language; and the 
second step to recognize the rendition of the said minimal parallel phrases in the parallel 
natural sentences selected based on the total value of the third evaluated value computed as 
the parallel translation of the minimal parallel phrases in the original text in question, 
searching, to the natural sentence extracted by the search process in the first step, from 
among the anterior word groups aligned toward the front side from the object phrases to be 
rendered in the original text, whether the specific anterior words non-retrieved and with 
minimum distance from the object phrases to be rendered are located toward the front of the 
object phrases to be rendered in the natural sentence or the anterior words extracted in the 
preceding search within the prescribed number of words, repeatedly assigning the third 
evaluated value according to the minimal distance between the specific anterior words upon- 
detection and the object phrases to be rendered or the anterior words extracted in the 
preceding search until the non-retrieved anterior words have ceased to exist in the 
aforementioned original text, and concomitantly searching, to the natural sentence extracted 
by the search process in the first step, from among the posterior word groups aligned toward 
the back from the object phrases to be rendered in the original text, whether the specific 
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posterior words non-retrieved and with minimum distance from the object phrases to be 
rendered are located toward the back of the object phrases to be rendered in the natural 
sentence or the posterior words extracted in the preceding search within the prescribed 
number of words, repeatedly assigning the third evaluated value according to the minimal 
distance between the specific posterior words upon detection and the object phrases to be 
rendered or the posterior words extracted in the preceding search until the non-retrieved 
posterior words have ceased to exist in the aforementioned original text, thus, computing the 
total value of the third evaluated value by the natural sentence. Therefore, in the same way as 
the invention as defined in claim 7, an appropriate parallel translation can be obtained for the 
object phrases to be rendered in the original text (i.e., parallel version with a higher 
possibility of acquiring a natural sentence as a sentence in the target language from the 
original text). 

[0035] The parallel translation identifying system related to the invention as defined in a 
fourteenth embodiment consists of two steps: the first step to search the natural sentence 
composed of object phrases to be rendered in the original text in a source language from out 
of the natural sentences in a source language composed of multiple words respectively stored 
in multiples in memory in response to the parallel version of the target language; and the 
second step to recognize at least the parallel version of the object phrases to be rendered in 
the parallel natural sentence selected based on the total value computed from the number of 
words by the first distance regarding the front side and from the number of words by the first 
distance regarding the back side as at least the parallel version for the object phrases in the 
original text, searching, regarding the natural sentence extracted by the search process in the 
first step, to see whether the specific anterior words non-retrieved and with minimal distance 
from the object phrases to be rendered are aligned on the front side of the object phrases to be 
rendered in the natural sentence or of the anterior words extracted in the preceding search 
process within the prescribed number of words from among the first anterior group of words 
located on the front side from the object phrases to be rendered in the original text, repeating 
the process of counting up the number of words by the first distance regarding the front side 
concordant with the interval from the object phrases to be rendered upon detection of the 
specific anterior words and from the anterior words extracted in the preceding search process 
till the non-retrieved anterior words have ceased to exist in the first anterior group of words, 
and concomitantly, searching, regarding the natural sentence extracted through the process by 
the search process in the first step, to see whether the specific posterior words non-retrieved 
and with minimal distance from the object phrases to be rendered are aligned on the back side 
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of the object phrases to be rendered in the natural sentence or of the posterior words extracted 
in the preceding search process within the prescribed number of words from among the first 
posterior group of words located on the back side from the object phrases to be rendered in 
the original text, repeating the process of counting up the number of words by the first 
distance regarding the back side concordant with the interval from the object phrases to be 
rendered upon detection of the specific posterior words and from the posterior words 
extracted in the preceding search process till the non-retrieved posterior words have ceased to 
exist in the first posterior group of words. Therefore, in the same way as the invention as 
defined in claim 8, an appropriate parallel translation can be obtained for the object phrases to 
be rendered in the original text (i.e., parallel version with a higher possibility of acquiring a 
natural sentence as a sentence in the target language from the original text). 
[0036] The program relating to the invention according to a fifteenth embodiment allows a 
computer connected to a storage means for storing a plurality of natural sentences in the 
source language composed of a plurality of words having corresponding translations in the 
target language to frmction as a retrieval means for retrieving the natural sentences containing 
the phrases to be translated in the original sentence in the source language from a plurality of 
natural sentences in the source language stored in the aforementioned storage means, a search 
means for searching the natural sentences extracted by the retrieval by the aforementioned 
retrieval means for the aforementioned phrases to be translated and the frequently appearing 
phrases that frequently appear in the same sentences of the source language, and a second 
translation identifying means for searching for the aforementioned phrases to be translated 
and the frequently appearing translations for the phrases to be translated in the corresponding 
translation sentences of the natural sentences each of which contains the aforementioned 
specific frequently appearing phrases and for identifying the foimd frequently appearing 
translations as the translations of the aforementioned phrases to be translated in the 
aforementioned original sentence by referring to the specific frequently appearing phrases 
found by the aforementioned search means and existing in the aforementioned original 
sentence among the natural sentences extracted by the retrieval by the aforementioned 
retrieval means and referring to the corresponding translation sentences of the natural 
sentences each of which contains the aforementioned phrases to be translated. 
The program relating to the invention according to the fifteenth embodiment is a program for 
allowing a computer connected to a storage means for storing a plurality of natural sentences 
in the source language composed of a plurality of words having corresponding translations in 
the target language (either a computer in which a storage means is built or a computer 
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connected to another computer connected to a storage means through telecommunication 
line) to function as the abovementioned retrieval means, search means, and translation 
identifying means. Therefore, when the abovementioned computer runs the program relating 
to the invention according to the fifteenth embodiment, the abovementioned computer 
functions as the translation identifying means according to the first embodiment. Therefore in 
the same manner as the invention according to the first embodiment, the suitable translations 
of the phrases to be translated in the original sentence (the translation having high probability 
of obtaining from the original sentence a natural translation as a sentence in the target 
language) can be obtained. 

[0036] The program relating to the invention according to the sixteenth embodiment allows 
a computer connected to a storage means for storing a plurality of natural sentences in the 
source language composed of a plurality of words having corresponding translations in the 
target language to function as a retrieval means for retrieving natural sentences containing 
phrase to be translated in original sentences in a source language firom a plurahty of natural 
sentences in said source language stored in said storage means, an identifying means for 
identifying altemative phrases existing in said original sentences and replaceable with 
targeted phrases not contained in natural sentences extracted by retrieval of said retrieval 
means, and a translation identifying means for identifying translations of at least said phrase 
to be translated in translated sentences of natural sentences containing each of altemative 
phrases identified by said identifying means and said phrase to be translated among natural 
sentences extracted by retrieval of said retrieval means, as translations of at least said phrase 
to be translated in said original sentences. 

The program relating to the invention according to the sixteenth embodiment is a program for 
allowing a computer connected to the abovementioned storage means to function as the 
abovementioned retrieval means, search means, and translation identifying means. Therefore, 
when the abovementioned computer runs the program relating to the invention according to 
the sixteenth embodiment, the computer functions as the translation identifying means 
according to the second embodiment. Therefore, in the same manner as the invention 
according to the second embodiment, the suitable translations of the phrases to be translated 
in the original sentence (the translation having high probability of obtaining from the original 
sentence a natural translation as a sentence in the target language) can be obtained. 
[0037] The program relating to the invention according to the seventeenth embodiment 
allows a computer connected to a storage means for storing a plurality of natural sentences in 
the source language composed of a plurality of words having corresponding translations in 
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the target language to function as a retrieval means for retrieving natural sentences containing 
phrase to be translated in original sentences in a source language from a plurality of natural 
sentences in said source language stored in said storage means, and a parallel translation 
identifying system for processing the minimal translation target phrase in the natiu*al parallel 
version selected based on the total value of the first evaluated value computed as the minimal 
object phrase in the original text by respectively adding the first evaluated value 
corresponding to the small distance between the phrase to be rendered and the respective 
words in the original text, identifying the words matching the original from among the 
respective words in the natural sentence in question about the natural one extracted through 
the retrieval tool, and by computing by the natural sentence for which the total value of the 
said first evaluated value added to the corresponding words were recognized. 
The program relating to the invention according to the seventeenth embodiment is a program 
for allowing a computer connected to the abovementioned storage means to fimction as the 
abovementioned retrieval means and translation identifying means. Therefore, when the 
abovementioned computer runs the program relating to the invention according to the 
seventeenth embodiment, the computer fimctions as the translation identifydng means 
according to the fifth embodiment. Therefore, in the same manner as the invention according 
to the fifth embodiment, the suitable translations of the phrases to be translated in the original 
sentence (the translation having high probability of obtaining from the original sentence a 
natural translation as a sentence in the target language) can be obtained. 
[0038] The program relating to the invention according to the eighteenth embodiment allows 
the parallel translation identifydng methodology to function recognizing at least the parallel 
version of the object phrases to be rendered in the parallel natural sentence selected based on 
the computed total value of the third evaluated value as at least the parallel version for the 
object phrases to be rendered in the original text, making the methodology work together 
with other fimctions: the memorizing means storing multiple natural sentences composed of 
multiple words in source language by matching the parallel sentences in a target language and 
the search unit, by means of the connected computer, retrieving a natural sentence containing 
the object phrases to be rendered in the original text in a source language from among 
multiple natural sentences in a source language stored in the memory, searching, regarding 
the natural sentence extracted through the process by the search unit, to see whether the 
specific anterior words non-retrieved and with minimal distance from the object phrases to be 
rendered are aligned on the front side of the object phrases to be rendered in the natural 
sentence or of the anterior words extracted in the preceding search process within the 
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prescribed number of words from among the anterior group of words located on the front side 
from the object phrases to be rendered in the original text, repeating the process of granting 
the third evaluated value concordant with the small interval between the detected specific 
anterior words if detected and the object phrases to be rendered or the anterior words 
extracted in the preceding retrieval process till the non-retrieved anterior words have ceased 
to exist in the original text, and concomitantly, searching to see whether the specific posterior 
words non-retrieved and with minimal distance from the object phrases to be rendered are 
ahgned on the back side of the object phrases to be rendered in the natural sentence or of the 
posterior words extracted in the preceding search process within the prescribed number of 
words from among the posterior group of words located on the back side from the object 
phrases to be rendered in the original text, repeating the process of granting the third 
evaluated value concordant with the small interval between the detected specific posterior 
words if detected and the object phrases to be rendered or the posterior words extracted in the 
preceding retrieval process till the non-retrieved posterior words have ceased to exist in the 
original text. 

The program relating to the invention according to the eighteenth embodiment is a program 
for allowing a computer connected to the abovementioned storage means to function as the 
abovementioned retrieval means and translation identifying means. Therefore, when the 
abovementioned computer runs the program relating to the invention according to the 
eighteenth embodiment, the computer fimctions as the translation identifying means 
according to the seventh embodiment. Therefore, in the same manner as the invention 
according to the seventh embodiment, the suitable translations of the phrases to be translated 
in the original sentence (the translation having high probability of obtaining from the original 
sentence a natural translation as a sentence in the target language) can be obtained. 
[0039] The program relating to the invention according to a nineteenth embodiment allows 
the parallel translation identifying methodology to function recognizing at least the parallel 
version of the object phrases to be rendered in the parallel natural sentence selected based on 
the nimiber of words by the first distance regarding the front side and from the number of 
words by the first distance regarding the back side as at least the parallel version for the 
object phrases to be rendered in the original text, making the methodology work together 
with other functions: the memorizing means storing multiple natural sentences composed of 
multiple words in a source language by matching the parallel sentences in a target language 
and the search unit, by means of the connected computer, retrieving a natural sentence 
containing the object phrases to be rendered in the original text in a source language from 
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among multiple natural sentences in a source language stored in the memory, searching, 
regarding the natural sentence extracted through the process by the search unit, to see 
whether the specific anterior words non-retrieved and with minimal distance fi-om the object 
phrases to be rendered are aligned on the firont side of the object phrases to be rendered in the 
natural sentence or of the anterior words extracted in the preceding search process within the 
prescribed number of words from among the first anterior group of words located on the fi-ont 
side fi-om the object phrases to be rendered in the original text, repeating the process of 
counting up the nimiber of words by the first distance regarding the fi-ont side corresponding 
to the distance fi-om the object phrases to be rendered when the specific anterior words are 
detected or the fi-om anterior words extracted in the preceding retrieval process till the non- 
retrieved anterior words have ceased to exist in the first anterior group of words, and 
concomitantly, searching to see whether the specific posterior words non-retrieved and with 
minimal distance from the object phrases to be rendered are aligned on the back side of the 
object phrases to be rendered in the natural sentence or of the posterior words extracted in the 
preceding search process within the prescribed number of words fi-om among the first 
posterior group of words located on the back side firom the object phrases to be rendered in 
the original text, repeating the process of coimting up the number of words by the first 
distance regarding the back side corresponding to the distance fi-om the object phrases to be 
rendered when the specific posterior words are detected or the fi-om posterior words extracted 
in the preceding retrieval process till the non-retrieved posterior words have ceased to exist in 
the first posterior group of words. 

The program relating to the invention according to the nineteenth embodiment is a program 
for allowing a computer connected to the abovementioned storage means to fimction as the 
abovementioned retrieval means and translation identifying means. Therefore, when the 
abovementioned computer runs the program relating to the invention according to the 
nineteenth embodiment, the computer fiinctions as the translation identifying means 
according to the eighth embodiment. Therefore, in the same manner as the invention 
according to the eighth embodiment, the suitable translations of the phrases to be translated in 
the original sentence (the translation having high probability of obtaining from the original 
sentence a natural translation as a sentence in the target language) can be obtained. 
Effect of Livention 

[0040] As described above, this invention retrieves natural sentences containing phrases to 
be translated in the original sentence in the source language from natural sentences in the 
source language composed of a pliurality of words stored in the storage means by correlating 
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the natural sentences with the translations of the target language, searches the natural 
sentences extracted by the retrieval for the phrases to be translated and the frequently 
appearing phrases that frequently appear in the same sentences of the source language, 
searches translated sentences of natural sentences containing each of the phrase to be 
translated and specific frequently appearing phrases for frequently appearing translations of 
phrase to be translated by referring to the specific frequently appearing phrases found by the 
search means and existing in the original sentences and referring to translated sentences of 
natural sentences containing each of the phrase to be translated and identifies found 
frequently appearing translations as translations of the phrases to be translated in the original 
sentences. Therefore, this invention has an excellent effect wherein the suitable translations of 
the phrases to be translated in the original sentence can be obtained. 

[0041] Moreover this invention retrieves natural sentences containing phrases to be translated 
in the original sentence in the source language from natural sentences in the source language 
composed of a plurality of words stored in the storage means by correlating the natural 
sentences with the translations of the target language, identifies altemative phrases existing in 
original sentences and replaceable with targeted phrases not contained in natural sentences 
extracted by retrieval of said retrieval means, and identifies translations of at least a phrase to 
be translated in translated sentences of natural sentences containing each of altemative phrase 
identified by said identifying means and phrase to be translated among natural sentences 
extracted by retrieval of said retrieval means, as translations of at least a phrase to be 
translated in original sentences. Therefore, this invention has an excellent effect wherein the 
suitable translations of the phrases to be translated in the original sentence can be obtained. 
[0042] Furthermore, this invention has an excellent advantageous effect capable of obtaining 
the most appropriate parallel version of the object phrases to be rendered in the original text 
since the parallel translation of the object phrases to be rendered at least in the parallel 
version of the natural sentence selected based on the total value of the first evaluated value 
computed is identified as at least the parallel version of the object phrases to be rendered in 
the original text, searching the natural sentence containing the object phrases to be rendered 
in the original text in a source language from out of the natural sentences in a source 
language respectively composed of multiple words stored in muUiples in memory in response 
to the parallel sentence in a target language, respectively granting the first evaluated value 
according to the small interval in the original text between the object phrases to be rendered 
in the original text for the respective words in the original text, identifying the concordant 
words with the original text from among the respective words of the natwal sentence in 
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question regarding the natural sentence extracted by the search process, thus, computing by 
the natural sentence the total value of the first evaluated value granted to the corresponding 
words identified. 

[0043] Additionally, this invention has an excellent advantageous eflFect capable of obtaining 
the most appropriate parallel version of the object phrases to be rendered in the original text 
since the parallel translation of the object phrases to be rendered at least in the parallel 
version of the natural sentence selected based on the total value of the third evaluated value 
computed is identified as at least the parallel version of the object phrases to be rendered in 
the original text, searching the natural sentence containing the object phrases to be rendered 
in the original text in a source language from out of the natural sentences in a source 
language respectively composed of multiple words stored in multiples in memory in response 
to the parallel sentence in a target language, searching, to the natural sentence extracted by 
the search process, from among the anterior word groups aligned toward the front side from 
the object phrases to be rendered in the original text, whether the specific anterior words non- 
retrieved and with minimum distance from the object phrases to be rendered are located 
toward the front of the object phrases to be rendered in the natural sentence or the anterior 
words extracted in the preceding search within the prescribed number of words, repeatedly 
assigning the third evaluated value according to the minimal distance between the specific 
anterior words upon detection and the object phrases to be rendered or the anterior words 
extracted in the preceding search imtil the non-retrieved anterior words have ceased to exist 
in the aforementioned original text, and concomitantly searching, to the natural sentence 
extracted by the search process, from among the posterior word groups aligned toward the 
back from the object phrases to be rendered in the original text, whether the specific posterior 
words non-retrieved and with minimum distance from the object phrases to be rendered are 
located toward the back of the object phrases to be rendered in the natural sentence or the 
posterior words extracted in the preceding search within the prescribed number of words, 
repeatedly assigning the third evaluated value according to the minimal distance between the 
specific posterior words upon detection and the object phrases to be rendered or the posterior 
words extracted in the preceding search until the non-retrieved posterior words have ceased 
to exist in the aforementioned original text, thus, computing the total value of the third 
evaluated value by the natural sentence. 

[0044] Additionally, this invention has an excellent advantageous effect capable of 
obtaining the most appropriate parallel version of the object phrases to be rendered in the 
original text since the parallel translation of the object phrases to be rendered at least in the 
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parallel version of the natural sentence selected based on the integrated recognition computed 
from the number of words by the first distance regarding the front side and the nvunber of 
words by the first distance regarding the back side is identified as at least the parallel version 
of the object phrases to be rendered in the original text, searching the natural sentence 
containing the object phrases to be rendered in the original text in a source language from out 
of the natural sentences in a source language respectively composed of multiple words stored 
in multiples in memory in response to the parallel sentence in a target language, exploring 
whether the specific front words non-retrieved and with minimal distance from the object 
phrases to be rendered among the first anterior group of words existing to the front to the 
object phrases to be rendered in the original text for the natural sentence extracted in the 
search in question are located on the front of the object phrases to be rendered in the natural 
sentence or anterior group of words extracted in the preceding search and within the 
prescribed nimiber of words, repeatedly counting up the number of words by the first distance 
regarding the front side corresponding to the interval from the object phrases to be rendered 
or from the front words extracted in the preceding search when specific anterior words have 
been detected until the non-retrieved front words have ceased to exist in the first anterior 
group of words, and concomitantly, exploring whether the specific back words non-retrieved 
and with minimal distance from the object phrases to be rendered among the first posterior 
group of words existing to the back to the object phrases to be rendered in the original text for 
the natural sentence extracted in the search in question are located on the back of the object 
phrases to be rendered in the natural sentence or posterior group of words extracted in the 
preceding search and within the prescribed number of words, repeatedly counting up the 
number of words by the first distance regarding the back side corresponding to the interval 
from the object phrases to be rendered or from the back words extracted in the preceding 
search when specific posterior words have been detected until the non-retrieved back words 
have ceased to exist in the first posterior group of words. 
Brief description of the drawings 

[0045]Figure 1 is a block schematic diagram showing a configuration of a personal computer 
related to the preferred embodiment. 

Figure 2 is a flow chart showing the contents of the translation identifying processing. 
Figure 3 is a table showing an example of natural sentences and the corresponding 
translations including a word "operation" registered in a translation database. 
Figure 4 is a block diagram for describing a mode where this invention is applied to a 
computer system to which a client personal computer and a server are connected through a 
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network. 

Figure 5 is a block schematic diagram showing a configuration of a machine translation 
system to which this invention is applied. 

The Best Mode of Embodiment of the Invention 

[0046] Now one of the preferred embodiments of this invention is described in detail and 
referenced by drawings. 

First, a mode where this invention is applied to an electronic dictionary system that provides 
a user with a translation of a specified phrase to be translated when the user specifies a phrase 
to be translated is described. 

[0047] Figure 1 shows a personal computer (PC) 10 capable of functioning as the 
abovementioned electronic dictionary. 

PC 10 comprises a CPU lOA, a ROM lOB, a RAM IOC and an I/O port lOD, which are 
connected with each other through a bus lOE composed of a data bus, an address bus, a 
control bus, etc., to configure the PC. 

A display 12 composed of a CRT, an LCD, etc., a keyboard 14 through which a user enters 
data, etc., a mouse 16, a hard disk drive (HDD) 18, a CD-ROM drive 20 that reads data fi-om 
CD-ROM 24, and a scanner 22 capable of reading paper data are connected to the I/O port 
lOD as various I/O devices. 

[0048] A translation-identifying program (corresponding to programs utilized in) the fifteenth 
and sixteenth embodiments for allowing PC 10 to fimction as an electronic dictionary system 
is installed in HDD 18 of PC 10. A corresponding translation database (corresponding 
translation database) storing data that the aforementioned translation-identifying program 
uses for identifying a suitable translation is stored in this HDD 18. 

There are several methods for installing (transferring) the translation-identifying program to a 
PC 10. For example, the translation-identifying program is installed by recording the 
translation-identifying program along with a setup program in CD-ROM 24 in advance, 
inserting the CD-ROM 24 into CD-ROM drive 20, and commanding CPU lOA to run the 
aforementioned setup program to read the translation-identifying program from CD-ROM 24 
in sequence and to write the read translation-identifying program in HDD 1 8 in sequence. 
[0049] The corresponding translation database can be stored in HDD 18 by, for example, 
recording the corresponding translation database in CD-ROM 24 in advance, and configuring 
the setup program so that the corresponding translation database is written in HDD 18 
simultaneously when installing the translation-identifying program. 
A large amount of text data of natural sentences composed of a plurality of words and 
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described in the source language (either one of sentences, clauses, phrases, stereotyped 
expressions and collocations, which are not subjected to editing or processing such as 
division by word, extraction of multivocal words, etc., of which specific examples to be 
described later) having corresponding text data of translations described in a target language 
(these translations are also natural sentences not subjected to editing or processing such as 
division by word, extraction of multivocal words, etc.) are registered in the corresponding 
translation database. In this embodiment, HDD 18 storing the corresponding translation 
database corresponds to a storage means relating to this invention. 
[0050] It is possible to use this corresponding translation database by recording the 
corresponding translation database on a recording medium, such as CD-ROM 24 and a DVD- 
ROM, and reading the corresponding translation database directly fi*om this recording 
medium. Thus in this embodiment, the medium storing the corresponding translation database 
functions as the storage means relating to this invention. 

Moreover, words in a source language and their corresponding translations in the target 
language are registered in the corresponding translation database relating to this embodiment. 
The corresponding translation database relating to this embodiment can be created by adding 
natural sentences and their translations as required to an existing dictionary containing, for 
example, words in the source language and their corresponding translations. 
[0051] Now an effect of this embodiment is described. 

In this embodiment, when existence of a phrase (a phrase to be translated that may be a word 
or a phrase composed of a plurality of continuously arranged words in the original sentence), 
of which translation in the target language is desired, is found in the original sentence 
described in the source language, a user performs a specified operation for commanding an 
output of the translation of a phrase to be translated in the original sentence through PC 10. 
The texts read in PC 10 as text data (for example text that a user entered through keyboard 14, 
text created using word processing software, text on the Web page viewed through the 
Internet, text obtained by reading character data by a scanner, and processing the data by 
OCR (Optical Character Recognition) system, etc.) can be used as the original sentence. 
[0052] As the abovementioned operation, the operation, such as selecting an item 
corresponding to "corresponding translation output" in a context menu that is displayed by 
right-clicking a phrase to be translated, can be used after reversely displaying the phrase to be 
translated by selecting the phrase to be translated in the condition that, for example, the 
original sentence is displayed on display 12. 

It is also possible to specify the output of a translation of a phrase to be translated by 
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performing an operation such as selecting the required icon displayed in, for example, the 
toolbar instead of using the context menu as described above. 

When the abovementioned operation for specifying the output of the translation of the phrase 
to be translated is performed, CPU 1 OA of PC 10 runs the translation-identifying program to 
perform the translation identifying processing shown in Figure 2. 
This translation identifying process is the process for which the parallel translation 
methodology as embodied in the tenth and eleventh embodiments was applied through the 
implementation of which process, PC 10 functions as an electronic dictionary system (the 
translation-identifying system embodied in the first and fourteenth embodiments). 
The original sentence to be processed may be a sentence containing the phrase to be 
translated or may be either one of a clause, a phrase, a stereotyped expression, or a 
collocation containing the phrase to be translated. When either one of a clause, a phrase, a 
stereotyped expression, or collocation is assxmied to be an original sentence to be processed, 
it is possible to allow the clause, phrase, stereotyped expression, or collocation as the original 
sentence to be processed to be specified by a user or to be automatically determined through 
the translation identifying processing. 

[0054] In the next Step 102, a natural sentence completely coinciding with the original 
sentence to be processed is retrieved fi'om natural sentences registered in the corresponding 
translation database using text data of the original sentence to be processed acquired in Step 
100 as a key, and a natural sentence containing the phrase to be translated (a natural sentence 
partially coinciding with the original sentence because at least the phrase to be translated is 
contained) is retrieved from the natiu*al sentences registered in the corresponding translation 
database using text data of the phrase to be translated as a key. 

When the phrase to be translated is composed of a plurality of words, natural sentences 
containing at least one of a plurality of words composing the phrase to be translated is 
retrieved at the same time. 

Step 102 corresponds to the retrieval means of the first, second, fifteenth, and sixteenth 
embodiments and also correspond to the first step of the tenth and eleventh embodiments. 
Step 104 through Step 112 corresponds to the first translation identifying means of the fourth 
embodiment. 

If the answer is positive, the program moves to Step 106 to read the corresponding 
translations in the target language correlated with a natural sentence completely coinciding 
with the original sentence to be processed and registered in the corresponding translation 
database and to display the corresponding translations on display 12 to end the processing. 
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When displaying these corresponding translations, the read translation is searched for the 
translation of the phrase to be translated and the translation of the found phrase to be 
translated is highlighted. 

hi this case a user can recognize the suitable translation of the specified phrase to be 
translated (the translation having high probability of obtaining from the original sentence a 
natural translation as a sentence in the target language) and can recognize the suitable 
translation of the original sentence to be processed containing the specified phrases to be 
translated (translation sentences natural as sentences in the target language). 
The abovementioned Steps 104 and 106 correspond to the first translation identifying means 
described in Claim 2. 

[0056] (Example l)The abovementioned processing is fiirther described using an actual 
example. 

Assuming that a source language is English and a target language is Japanese, when "For 
your safety" contained in an original sentence to be processed "For yom safety, don't rush 
into the train" is specified as a phrase to be translated, the translation "^^" is selected 
against "safety" that composes the abovementioned phrase to be translated because the source 
language and the corresponding translation are registered word by word in a general 
translation process. 

However, in the translation identifying processing relating to this embodiment, since natural 
sentences in the source language and their corresponding translations in the target language 
are registered in the corresponding translation database, there is a possibility that natural 
sentences completely coinciding with the abovementioned original sentences are registered in 
the corresponding translation database. 

When natural sentences completely coinciding are registered in the corresponding translation 
database, the translations registered in the corresponding translation database correlated with 
the natural sentences are displayed with the translations of the phrases to be translated 
highHghted as shown below: 

hi the general translation process, it is difficuU to select "^aE^" as the corresponding 

translation of the word "safety" and only an unnatural translation such as "Anata no anzen no 
tame, sono ressha ni kakekomanaide kudasai." is obtained. On the other hand, this invention 
provides a translation natural as native language. 

[0057] When displaying the translations in Step 106, it is possible to allow natural sentences 
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partially coinciding with the original sentence to be processed extracted by the retrieval in 
Step 102 to be displayed in addition to the natural sentences completely coinciding with the 
original sentence to be processed. 

A plurality of the natural sentences partially coinciding with the original sentence to be 
processed is generally extracted by the retrieval. When displaying these, it is preferable to 
calculate a degree of coincidence to be described below for individual natural sentences 
partially coinciding with the original sentences to be processed in order to display the 
translation of each natural sentence on display 12 in the descending order of the degree of 
coincidence. 

[0058] On the other hand, when the natural sentences completely coinciding with the original 
sentences to be processed are not foimd in the corresponding translation database by the 
retrieval in Step 102, the answer of the question in Step 104 is negative. Then the program 
moves to Step 108 to count the ntimber of words coinciding with either one of words in the 
original sentence (number of coinciding words) among the words in the natural sentence as 
for natural sentences extracted from the corresponding translation by the retrieval in Step 102 
(natural sentences partially coinciding with the original sentences to be processed) and to 
calculate the degree of coincidence with the original sentence based on the results of cotmting 
the number of coinciding words. 

[0059] In this embodiment, when installing the translation-identifying program, a frequently 
appearing word table in which the words frequently appearing in natural sentences in the 
source language (for example, "a," "the," "to," and "in" in natural English sentences) is also 
stored on HDD 18. 

The number of coinciding words is counted in Step 108 while referring to the frequently 
appearing word table to exclude the words registered in the frequently appearing word table 
from the number of coinciding words to be counted. 

Thus, the influence of the frequently appearing words on the number of coinciding words can 
be avoided. 

[0060] In this embodiment, when installing the translation-identifying program, an inflected 
word table where words having different endings due to the difference of singular/plural or 
tense are registered is also stored in HDD 18. 

When the number of coinciding words is counted in Step 108 and the words having only 
mismatching endings appear, whether the reason for a mismatching ending exists in the 
difference of the singular/plural or in the difference of the tense is asked, and the words 
having different endings due to the difference of the singular/plural or the tense are coimted 
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assumed as the coinciding words. 

Instead of this, the system in which only irregular inflections are registered in the inflection 
word table and regular inflections (for example, "s" for plural form and "ed" for past tense) 
are automatically assumed to be the coinciding words is acceptable. 
Thus, the influence of the words having different endings due to the singular/plural or the 
tense (words that should be fundamentally recognized as the coinciding words) on the 
number of coinciding words can be avoided. 

[0061] In this embodiment, a word once counted as the coinciding words for the reason that 
the word coincides with a certain word in the original sentence during counting of the number 
of coinciding words in Step 108 are not counted twice even if the word reappears in a natural 
sentence, in order to avoid repetitively counting the coinciding words that appeared twice or 
more. 

Thus even if the same coinciding word exists in multiple places of the natural sentence, the 
influence of this coinciding word on the number of coinciding words can be avoided. 

[0062] A formula for calculating the degree of coincidence can be defined so that the degree 
of coincidence increases as the number of coinciding words increases. For example, a 
calculation formula for normalizing the number of coinciding words based on the nimiber of 
words composing the phrases to be translated (degree of coincidence = number of coinciding 
words divided by number of words composing phrases to be translated) can be used. 

[0063] In the next Step 110, the degrees of coincidence calculated in each natural formula 
are compared to ask whether a plurality of natural sentences having the maximum degree of 
coincidence exists or not. 

If the answer is negative, the program moves to Step 112 to read the corresponding 
translation sentence in the target language registered in the corresponding translation database 
having a corresponding natural sentence of the maximum degree of coincidence, to search the 
read translation sentence for the translation of the phrase to be translated, to display the read 
translation sentence on display 12 with the translation of the found phrase to be translated 
highlighted, and to end the processing. 

In this case, the user can recognize a suitable translation of the specified phrase to be 
translated (the translation having high probability of obtaining fi-om the original sentence a 
natural translation as a sentence in the target language). 

[0064] In Step 112, in addition to the natural sentences having the maximum degree of 
coincidence among the natural sentences extracted by the retrieval in Step 102, the translated 
sentences of a plurality of natural sentences (a certain number of natural sentences in a 
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descending order of the degree of coincidence, or all the natural sentences of which degree of 
coincidence is higher than the specified value) are also read fi-om the corresponding 
translation database in a descending order of the degree of coincidence and displayed on 
display 12 at the same time. 

If a plurality of natural sentences having the same degree of coincidence exists, the nimiber of 
words mismatched with the original sentence in each natural sentence is coimted, the 
translated sentence corresponding to the natural sentence having the same degree of 
coincidence is displayed in ascending order (fi-om smaller) of the number of mismatched 
words in the corresponding natural sentences to display the translated sentence at the same 
time. 

[0065] (Example 2) The abovementioned processing is further described using an actual 
example. 

Assxmiing that a source language is English and a target language is Japanese, when a word 
"operation" contained in an original sentence to be processed, "The system suddenly came 
into operation," is specified and natural sentences (1) through (8) shown in Figure 3 are 
correlated with translated sentences (1) through (8) and registered in the corresponding 
translation database as natural sentences containing the phrase to be translated "operation," 
the natural sentences other than (3) shown in Figure 3 have the number of coinciding words 
with the original sentence of "1" with the degree of coincidence of 100% ("operations" in 
natural sentence (2) is also counted as a coinciding word as aforementioned) and natural 
sentence (3) has the number of words coinciding with the original sentence of "4" (the word 
*the" is excluded fi-om the coinciding words to be counted as a frequently appearing word) 
with the degree of coincidence of 400%. Therefore, a translated sentence (3) correlated with 
natural sentence (3) and registered in the corresponding translation database is displayed with 
the phrase to be translated highlighted as shown below: 

(Example 3) Assimiing that a soiu-ce language is English and a target language is Japanese, 
when a word "operation" contained in an original sentence to be processed, "A small business 
is exempt from the operation of the new law," is specified and natural sentences (1) through 
(8) shown in Figure 3 are provided with corresponding translated sentences (1) through (8) 
and registered in the corresponding translation database as natural sentences containing the 
phrase to be translated "operation," the natural sentences other than (2) have the number of 
words coinciding with the original sentence of "1" with the degree of coincidence of 100% 
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and natural sentence (2) has the number of words coinciding with the original sentence of "4" 
(the word "the" is excluded from the coinciding words to be counted as a frequently 
appearing word) with the degree of coincidence of 400%. Therefore, a translated sentence (2) 
correlated with natural sentence (2) and registered in the corresponding translation database is 
displayed with the phrase to be translated highlighted as shown below: 

The word "operation" in English sentences is a multivocal word for which a suitable 
translation is difficult to identify. As easily understood from the abovementioned Examples 2 
and 3, a suitable natural sentence (translated sentence) where the multivocal word is used in 
the same meaning as the original sentence can be obtained by registering translated sentences 
correlated with natural sentences in the corresponding translation database and selecting a 
natural sentence (translated sentence) based on the degree of coincidence. 
[0066] (Example 4) In the case of Chinese sentences, it is often difficult to identify the imit in 
which a translation should be obtained because it is difficult to distinguish a mass of meaning 
without knowing the Chinese language sufficiently. 

hi this translation identifying processing relating to this embodiment, in order to solve this 
problem, natural sentences containing the specified phases to be translated are retrieved from 
the natural sentences registered in the corresponding translation database (when the phrase to 
be translated is composed of a plurality of words, natural sentences containing at least one of 
a plurality words composing the phrase to be translated are also retrieved at the same time) 
and translated sentences having a high degree of coincidence with the entire original 
sentences (including the translated sentences of natural sentences other than natural sentences 
having the highest degree of coincidence) are displayed. Therefore, a user can recognize the 
unit in which translation should be obtained from the original sentences and their suitable 
translations at the same time by referring to the displayed translated sentences. 
[0067] For instance, here is a case of translating from a source language to a target language: 
an original Chinese expression "^M't'SI^" to a Japanese expression. When any one of the 
expressions of "^", "^M", "^Jg4^H"or"^il4ig|^» is specified as a target 

word or a phrase to be rendered, a Hst of the following natural expressions in parallel 
translation will be displayed in sequence as follows based on the coincidence. (Incidentally, 
the following concomitantly shows the coincidence when "^M^" H^" is assigned as an 
object phrase to be rendered but it goes without saying that the individual coincidence will 
vary depending on a word or phrases specified.) 
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1. "^i|i4^11^"="^Mit±ll" (Concordant number of words: 5; Coincidence: 100%) 

2. "^M* Sl"="4' ffl ^iUS $ -H: i> "(Concordant number of words: 4; Coincidence: 80%) 

3. "^® tfi"="SgM43<D"(Concordant number of words: 3; Coincidence: 60%) 

4. "^®"="^M"^ ^ s ^ 1± ^"(Concordant number of words: 2; Coincidence: 40%) 

5. "^"="^-^^^ ai"^"(Concordant number of words: 1; Coincidence: 20%) 

Thus, a user can recognize the luiit in which translation should be obtained from the original 
sentences (in this case "^M^iffl^") and their suitable translations at the same time by 
referring to the translated sentences listed up above. 

[0068] When both an original sentence and a phrase to be translated are each of the 
abovementioned natural sentences has the number of coinciding words "1 " and the degree of 
coincidence 100%. However, since the number of mismatched words is different from each 
other, original sentences are listed in the following order different from the case of the 
original sentence "^M^H^": 

1. (Number of mismatched words "0") 

2. "f§il" (Number of mismatched words "1") 

3. "^il4'" (Number of mismatched words "2") 

4. ":^J143a" (Number of mismatched words "3") 

5. "il® 4^11^" (Number of mismatched words "4") 

In the similar way, when both an original sentence and a phrase to be translated are "^®," 
original sentences are also hsted in the following order different from the case of the original 
sentence "^M^^ H^" or based on the degree of coincidence and the number of 

mismatched words: 

1. "^®" (Number of coinciding words "2," degree of coincidence 100% and number of 
mismatched words "0") 

2. (Number of coinciding words "2," degree of coincidence 100% and nimiber of 
mismatched words "1") 

3. "f§® ff>g|" (Nxunber of coinciding words "2," degree of coincidence 100% and nxmiber of 
mismatched words "2") 

4. "'ftM^MM'' (Number of coinciding words "2," degree of coincidence 100% and number 
of mismatched words "3") 

5. (Number of coinciding words "1," degree of coincidence 50% and number of 
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mismatched words "0") 

Even when the phrase to be translated is the same as this, translated sentences are listed in a 
suitable order according to the natural sentences as shown above if the original sentence is 
different. Therefore, a user can recognize a suitable translation corresponding to the original 
sentence by referring to the listed results of the translated sentences. 
[0069] When a pliu-ality of natural sentences having the maximum degree of coincidence 
exists, the answer to the question in Step 1 10 is positive. Then the program moves to Step 1 14 
to perform a correlation analysis for selecting a natural sentence based on the phrases to be 
translated and the words that appear frequently in the same sentences of the source language 
(words having a high correlation with the phrase to be translated: corresponding to the 
frequently appearing phrases of the first, tenth, and fifteenth embodiments) in Steps 114 and 
116. 

[0070] That is to say, in Step 114, whether the phrase to be translated and the words 
frequently appearing in the same sentence of the source language (words having a high 
correlation with the phrase to be translated) are contained in the original sentence to be 
processed or not is investigated by investigating whether each word other than the phrase to 
be translated in the original sentence to be processed (however, the frequently appearing 
words registered in the frequently appearing word table are excluded) is contained in each 
natural sentence extracted from the corresponding translation database by the retrieval in Step 
102 or not. 

In this processing, for example, the words having the nimiber of times of appearing in each 

natural sentence greater than the specified number of times (once, twice or more) among each 

word other than the phrases to be translated in the original sentences to be processed can be 

identified as the word having a high correlation to the phrase to be trsmslated. 

The abovementioned Step 114 cop-esponds to the search means described in the first and 

fifteenth embodiments and corresponds to the second Step the tenth embodiment. 

[0071] In the next Step 116, whether the words having a high correlation with the phrase to 

be translated are found by the retrieval in Step 1 14 or not is asked. 

When the answer is positive, the program moves to Step 1 1 8 to refer to the translated 

sentences of the phrase to be translated and the natural sentences each of which contains the 

words having a high correlation with the phrase to be translated determined in Step 114 

(words existing in the original sentence to be processed) among each natural sentence 

extracted from the corresponding translation database by the retrieval in Step 102, and to 

identify the frequently appearing translation of the phrase to be translated in the 
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abovementioned natural sentences. 

Specifically speaking for the frequently appearing translation of the phrase to be translated, 
when the phrase to be translated and a single natural sentence is extracted as a natural 
sentence containing each of the words having a high correlation with the phrase to be 
translated, the translation of the phrase to be translated in the natural sentences is identified as 
the frequently appearing translation. When a plurality of natural sentences is extracted as the 
abovementioned natural sentences, the translation of the most frequently appearing in the 
translations of the phrases to be translated in these natural sentences are identified as the 
frequently appeetring translations. 

[0072] As mentioned above, when natural sentences containing the phrases to be translated 
and containing words identical to the original sentence to be processed as the words having a 
high correlation with the phrases to be translated exist, there is a high probability that the 
natural sentence is a sentence that uses the phrase to be translated as a sentence having the 
same meaning as the original sentence. There is also a possibility that a natural sentence 
having different translations of the phrase to be translated is mixed in the abovementioned 
natural sentence. 

On the other hand, in Step 118, the frequently appearing translations of the phrases to be 
translated in the abovementioned natural sentences are identified by referring to the translated 
sentences of natural sentences containing each of the phrases to be translated and the words 
having a high correlation with the phrase to be translated. Therefore, the suitable translations 
of the phrases to be translated in the original sentences to be processed can be obtained. 
[0073] In Step 119, the translated sentences in the target language registered in the 
corresponding translation database correlated with the natural sentences in which the phrases 
to be translated are correlated with frequently appearing translations found in Step 1 1 8 are 
read from the natural sentences containing each of the phrases to be translated and the words 
having a high correlation with the phrases to be translated, and the read translated sentences 
are displayed on display 12 to end the processing with the translations of the phrases to be 
translated and the words having a high correlation with the phrases to be translated 
highlighted on the read translated sentences. 

Also, in this case, a user can recognize a suitable translation of the specified phrase to be 
translated (the translation having high probability of obtaining from the original sentence a 
natural translation as a sentence in the target language). 

The abovementioned Steps 116 through 119 correspond to the first translation identifying 
means of the first and fifteenth embodiment, and the third Step of the tenth embodiment. 
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respectively. 

[0074] Also in Step 1 19, in addition to the natural sentences containing each of the phrases to 
be translated and the words having a high correlation with the phrases to be translated and 
where the phrases to be translated are correlated with the frequently appearing translations 
among the natural sentences extracted by the retrieval in Step 102, the translations of a 
plurality of natural sentences (a certain number of natural sentences in the descending order 
of the degree of coincidence, or all the sentences having the degree of coincidence higher 
than the specified value) are read from the corresponding translation database in descending 
order of the degree of coincidence and simultaneously listed on display 12 in an order 
according to the degree of coincidence or the number of mismatched words. 
[0075] (Example 5) The abovementioned processing is further described using an actual 
example. Assuming that an original language is English and an object language is Japanese, 
when a word "operation" contained in the original sentence to be processed, "I had an 
operation to remove a rectal cancer," is specified and the natural sentences (1) through (8) 
shown in Figure 3 and other natural sentences are registered in the corresponding translation 
database as the natural sentences containing the phrase to be translated "operation," it is 
diflBcuh to select a most suitable natural sentence (translated sentence) only from the degree 
of coincidence because both the natural sentences (6) and (8) have the number of coinciding 
words of "2." 

When one of the key words other than the phrase to be translated "operation" in the original 
sentence to be processed (for example "remove," "rectal" or "cancer") is retrieved from the 
natural sentences (1) through (8), "cancer" is extracted as a word having a high correlation 
with the phrase to be translated "operation" in the original sentence to be processed because 
the natural sentences (6) and (8) contain "cancer." 

[0076] There is a high probability that the natural sentences containing the phrase to be 
translated "operation" and the word "cancer" having a high correlation are the sentences that 
use the phrase to be translated "operation" in the same meaning as the original sentence to be 
processed. However, there is also a possibility that the natural sentences in which the 
translations of the phrase to be translated differ from the phrase to be translated are mixed in 
the natural sentences that are registered in the corresponding translation database and that 
satisfies the abovementioned condition. 

In this example, the natural sentence (8) corresponds to this case. 

Although the natural sentence (8) contains the phrase to be translated "operation" and the 
word "cancer" having a high correlation, the natural sentence (8) differs from the original 
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sentence to be processed because the translation of the phrase to be translated "operation" is 

[0077] In order to prevent incorrect identification of the translation of the phrase to be 
translated due to the influence of these natural sentences, the translated sentences of the 
natural sentences containing the phrase to be translated "operation" and the word "cancer" 
having a high correlation are referred to. 

The phrase to be translated "operation" and the word "cancer" having a high correlation is 
contained in a natural sentence other than natural sentence (6) and a natural sentence in which 
the phrase to be translated "operation" is correlated with the translation "^$ff" is registered 
in the corresponding translation database. Therefore, the frequently appearing translation of 
the phrase to be translated "operation" in the natural sentences containing the phrase to be 
translated "operation" and the word "cancer" having a high correlation is determined to be 

the natural sentence (6) that contains the phrase to be translated "operation" and the 
word "cancer" having a high correlation and in which the phrase to be translated is correlated 
with the determined frequently appearing corresponding translation is selected, and the 
translated sentence (6) correlated with the natural sentence (6) and registered in the 
corresponding translation database is displayed with the translation of the phrase to be 
translated and the word having a high correlation highlighted as shown below. 

"V3>$^(Z)^^B^. ^i&omm^^fzr 

Therefore, also in this example, a suitable natural sentence (translated sentence) that uses the 
phrase to be translated "operation" in the same meaning as the original sentence is selected. 
[0078] hi Figure 3, although the number of natural sentences containing "operation" is 
restricted to simpHfy the description, more natural sentences are actually registered. When the 
original sentence to be processed is, for example, a sentence using the phrase to be translated 
"operation" in the meaning of "^^Rf," the word such as "injury" or "hospital" is extracted as 
the word having a high correlation depending on the content of the original sentence to be 
processed. When the original sentence to be processed is, for example, a sentence using the 
phrase to be translated "operation" in the meaning of "^flll," the word such as "system" or 
"computer" is extracted as the word having a high correlation depending on the content of the 
original sentence to be processed. When the original sentence to be processed is, for example, 
a sentence using the phrase to be translated "operation" in the meaning of "JSffl," the word 
such as "law" is extracted as the word having a high correlation depending on the content of 
the original sentence to be processed. 
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[0079] Meanwhile, in case words with higher correlation with the object phrase to be 
rendered have not been extracted, the determination in Step 1 16 is negated, moving to Step 
120 and further to 120 and 134 during the interval of which steps a scheme analysis is 
implemented selecting natural sentences based on the alternative words, i.e., equivalent to the 
alternative phrases of the second, third, eleventh, and sixteenth embodiment, that can replace 
with focus words other than the target phrases to be rendered in the original text to be 
processed. 

[0080] That is to say, the word existing in the original sentence to be processed in Step 120 
and not existing in each natural sentence extracted from the corresponding translation 
database by the retrieval in Step 102 (targeted word) is identified. 

In Step 122, a natural sentence containing the targeted word is retrieved from the natural 
sentences registered in the corresponding translation database. 

When a plurality of the targeted words exists, the retrieval in Step 122 is performed for each 
word. 

In Step 124, a retrieval condition for retrieving each of the natural sentences ("natural 
sentence having the same sentence structure as the natural sentence extracted by the retrieval 
of natural sentences containing the targeted phase" of the third embodiment) only of which 
targeted word in each natural sentence extracted by the retrieval in Step 122 differs is defined 
to retrieve the natural sentence from the natural sentences registered in the corresponding 
translation database based on the defitned retrieval condition. 

0081] In the next Step 126, whether the matching natural sentence is extracted by the 
retrieval in Step 124 or not is asked. 

When each of the same translated sentences exists in the corresponding translation database 
only with the targeted word replaced with the other word, 

there is a high probability that the aforementioned other word is an alternative word that can 

be used instead of the targeted word. Therefore, when the answer is positive, the program 

moves to Step 128 to recognize the word replaced with the targeted word in the natural 

sentences extracted by the retrieval as the alternative word for the targeted word. 

The abovementioned Steps 120 through 130 correspond to the identifying tool of the second 

embodiment (see the thirteenth embodiment for details), the identifying tool of the sixteenth 

embodiment, and the second step described in the eleventh embodiment. 

[0082] In Step 130, whether a natural sentence in which the targeted word is replaced with 

the alternative word exists in a plurality of natural sentences having the maximum degree of 

coincidence (this may be substituted by "degree of coincidence more than the specified 
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value") or not is asked. 

When a natural sentence in which targeted word is replaced with the alternative word exists 
in a plurality of natural sentences having the maximum degree of coincidence, it is 
determined that there is a high probability that the natural sentence concemed is a sentence 
that uses the phrase to be translated in the same meaning as the original sentence to be 
processed. 

Therefore, if the answer is positive, the program moves to Step 132 to read the translated 
sentence of the target language registered in the corresponding translation database correlated 
with the natural sentence having the maximum degree of coincidence (this may be substituted 
by "degree of coincidence more than the specified value") and containing the altemative 
word instead of the targeted word, to search the read translated sentence for the translations 
of the phrase to be translated and of the altemative word, to display the read translated 
sentence on display 12 with the translation of the recognized phrase to be translated 
highlighted and with the translation of the altemative word marked so as to be recognized 
also as an altemative word, and to end the operation. 

Also in this case, a user can recognize a suitable translation of the specified phrase to be 
translated (the translation having high probability of obtaining fi-om the original sentence a 
natural translation as a sentence in the target language). 

The abovementioned Steps 130 and 132 correspond to the first translation identifying tool of 
the second and sixteenth embodiment, and the third step of the eleventh embodiment. 
[0083] Also in Step 132, in addition to the natural sentences having the maximum degree of 
coincidence and containing the altemative word instead of the targeted word among the 
natural sentences extracted by the retrieval in Step 102, the translated sentences of a plurality . 
of natural sentences (a certain number of natural sentences in the descending order of the 
degree of coincidence, or all the natural sentences having a degree of coincidence more than 
the specified value) are read in the descending order of the degree of coincidence and listed 
on display 12 in the order according to the degree of coincidence and the number of 
mismatched words at the same time. 

[0084] (Example 6) The abovementioned processing is further described using an actual 
example. 

Assuming that a source language is English and a target language is Japanese, when a word 
"have" is specified as the phrase to be translated in the original sentence to be processed 
containing "have lunch," a natural sentence containing "have breakfast" (a natural sentence 
that uses the phrase to be translated "have" in the same meaning as the original sentence to be 
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processed) is registered in the corresponding translation database, a natural sentence 
containing "havie lunch" is not registered, and an appropriate translated sentence cannot be 
identified even by investigation based on the degree of coincidence or a correlation analysis, 
the phrase to be translated in the original sentence to be processed ("have") or each word 
other than the words that are determined to have high correlation with the phrase to be 
translated (for example, "lunch") are assumed to be the targeted words, and the natural 
sentences containing each targeted word are retrieved. 

Thus, as for the targeted word "lunch," the natural sentences containing, for example, "eat 
lunch" or "take a late lunch" are extracted. 

[0085] Then the natural sentences in which only the targeted word in each natural sentence 
differs are retrieved fi*om each of the natural sentences extracted by the aforementioned 
retrieval. 

Thus as for the natural sentence extracted as a natural sentence containing, for example, the 
targeted word "lunch," natural sentences containing, for example, "eat breakfast" or "take a 
late breakfast" are extracted as the natural sentences in which only the targeted word differs. 
Therefore, since the word "breakfast" can be judged as an alternative word highly 
interchangeable with the focus word "lunch", from among the natural sentences extracted by 
the initial search (retrieval of natural sentences containing the object phrase for rendition 
"have"), the parallel translation ":^^-5" for the object phrase for rendition "have" is 
emphatically displayed, and concomitantly, the parallel version, i.e., rendition for the 
altemative word "breakfast," which is "^:fe" registered in the parallel translation DB in the 
form corresponding to the natural sentence, i.e., natural sentence containing the phrase "have 
breakfast" in which the altemative word "breakfast" is replaced with the focus word "lunch" 
is markedly output. 

Therefore, also in this example, a suitable natural sentence (translated sentence) using the 
phrase to be translated "have" in the same meaning as the original sentence to be processed is 
selected. 

[0086] Moreover, in similar cases such as even the judgment based on coincidence and the 
correlation analysis could not specify an adequate parallel rendition, the "have" in the 
original text containing "have cats" to be rendered was assigned as a target phrase for 
rendition but despite the natural sentences containing "have dogs" in the DB corpus, natural 
sentences containing "have cats" have not been entered, and also, when an adequate parallel 
rendition could not be specified even by the judgment based on coincidence and the 
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correlation analysis, such words as "cats" are regarded as a focus word, exactly in the same . 
way as above, since the word "dogs" can be judged as an alternative word highly 
interchangeable with the focus word "cats", from among the natural sentences extracted by 
the initial search (retrieval of natural sentences containing the object phrase for rendition 
"have"), the parallel translation "^ o " for the object phrase for rendition "have" is 
emphatically displayed, and concomitantly, the parallel version, i.e., rendition for the 
alternative word "dogs," which is "^" registered in the parallel translation DB in the form 
corresponding to the natural sentence, i.e., natural sentence containing the phrase "have dogs" 
in which the alternative word "dogs" is replaced with the focus word "cats" is markedly 
output. 

Therefore, also in this example, a suitable natural sentence (translated sentence) using the 
phrase to be translated "have" in the same meaning as the original sentence to be processed is 
selected. 

[0087] (Example 7) Assxmiing that a source language is Japanese and a target language is 
English, when "A^I'J'^" in an original sentence to be processed "fc^^'A^f'f ^" is specified 
as the phrase to be translated, a natural sentence "tK ^ti^if^ " (natural sentence using the 
phrase to be translated "^^ l"t in the same meaning as the original sentence to be 
processed) is registered in the corresponding translation database, but "fc^^A'^l't-S" is not 
registered, and an appropriate translated sentence cannot be identified even by the 
investigation based on the degree of coincidence or the correlation analysis because other 
natural sentences, such as"^i5 A^lt-S," are registered, "fc^" in the original sentence to 
be processed is assimied to be the targeted word and the natural sentences containing the 
targeted word are retrieved. 

Thus, the natural sentences such as "fc^ICO If "^3^^:^/)^"^ "fc^^SSf "*J 

"fc^A^ t> §1 ^±lf -5" and "fc^T*;Sto -5" are extracted. 
[0088] Then the natural sentences in which only the targeted word in each natural sentence 
differs are retrieved from each of the abovementioned extracted natural sentences. 
As the results of this retrieval, the following words are obtained as the words replaceable with 
the targeted word "fc^" 
"~ ni tsukeru" 

. . . " V — X "" L J: "5 ""7K'"'X —V^—T. ""M" 

"--;hA^-ro " 
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. .None 
''-A^t>?l^±lf^o " 

Among the abovementioned words, the word "tK" frequently appears and the natural 
sentence "7X ^ It -5" exists. 

Therefore the word "tK" can be determined as the alternative word having a high probability 
of being replaceable with the targeted word "fc^" in the original sentence to be processed 
"fc^ ("t ." Thus the translation correlated with the natural sentence "tK in 
which the targeted word "fc^" is replaced with the altemative word "tK," and registered in 
the corresponding translation database among the natural sentences extracted by the first 
retrieval (retrieval of the natiu-al sentences containing the phrase to be translated "A^ft^") is 
displayed with the translation ("pour") of the phrase to be translated ":6^ If highlighted 
and the translation (*^vater") of the altemative word marked as shown below, 
"pour *water* into ~" 

In the abovementioned translated sentence, . shows that the word ". . ." is an altemative 
word. 

Therefore, also in this example, a suitable natural sentence (translated sentence) that uses the 
phrase to be translated "A^lf ^" in the same meaning of the original sentence to be 
processed is selected. 

[0089] In the similar way, when "A^ If in an original sentence to be processed "3 ff^ If 
-S" is specified as a phrase to be translated, a natural sentence "4 A^Cf ^" (natural 
sentence using the phrase to be translated "A^ If ^" in the same meaning as the original 
sentence to be processed) is registered in the corresponding translation database, but "3 A^ 
If § " is not registered, and an appropriate translated sentence cannot be identified even by 
investigation based on the degree of coincidence or a correlation analysis, the translation 
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correlated with the natural sentence "4 wo kakeru," in which the targeted word "3" is replaced 
with "4," and registered in the corresponding translation database among the natural 
sentences extracted by the first retrieval (retrieval of the natural sentences containing the 
phrase to be translated "^^ It -5") is displayed with the translation ("multiply") of the phrase 
to be translated "A^Ct^" highlighted and with the translation ("four") of the alternative word 
"4" marked by assuming the.words such as "3" to be the targeted word and determining the 
word "4" as the alternative word having high probability of being replaceable with the 
targeted word "3." 
"multiply by *four*" 

Therefore, also in this example, a suitable natiu^al sentence (translated sentence) that uses the 
phrase to be translated "A^ (t -5" in the same meaning of the original sentence to be 
processed is selected. 

[0090]C)n the other hand, when the conforming natural sentence is not extracted in the 
retrieval of the natural sentences in which only the targeted word differs (when the answer in 
Step 126 is negative), or when the natural sentence in which the targeted word is replaced 
with the alternative word does not exist in a plurality of natural sentences having the 
maximum degree of coincidence (when the answer in Step 130 is negative), it is difficult to 
identify a suitable translated sentence. Therefore the program moves to Step 1 34 to read each 
of all the natural sentences extracted by the retrieval in Step 102 or the translated sentences 
correlated with a plurality of natural sentences in the descending order of the degree of 
coincidence (a certain number of natural sentences in the descending order of the degree of 
coincidence, or all the natural sentences having the degree of coincidence more than the 
specified value) and registered in the corresponding translation database, to search each of the 
read translated sentence for the translation of the phrase to be translated, to list the individual 
translated sentences on display 12 in order depending on the degree of coincidence and the 
number of mismatched words in descending order of the degree of coincidence of each 
natural sentence having the maximum degree of coincidence with the translation of the found 
phrase to be translated highlighted and to end the processing. In this case, a user can 
recognize some candidates for suitable translations of the specified phrase to be translated. 
In this case, a user can recognize some candidates for suitable translations of the specified 
phrase to be translated. 

[0091] In the abovementioned examples, although the mode for calculating the degree of 
coincidence using only the number of coinciding words is described, modes are not limited to 
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this. It is acceptable to define the formula for calculating the degree of coincidence so that the 
degree of coincidence increases as the number of coinciding words between natural sentences 
and original sentences increases in order to calculate and evaluate the degree of coincidence 
based on the nxunber of coinciding words and the number of mismatched words. 
It is also acceptable to first evaluate the degree of coincidence only based on the nimiber of 
coinciding words, to count the nxmiber of mismatched words when it is difficult to select a 
single natural sentence (translated sentence) even by performing a correlation analysis or a 
scheme analysis, and to select a natural sentence (translated sentence) having the minimum 
number of words mismatching the original sentence. 

[0092] In evaluating the degree of coincidence, it is also acceptable to commonly use a 
degree of similarity of arranged order of a natural sentence and an original sentence, or the 
nimiber of mismatched words in natural sentences existing in the words coinciding with the 
original sentence, in addition to the abovementioned number of coinciding words and number 
of mismatched words to evaluate the degree of coincidence so that the degree of coincidence 
with the original sentence increases as the similarity of arranged order of words between a 
natural sentence and an original sentence increases or to evaluate the degree of coincidence 
so that the degree of coincidence with the original sentence increases as the number of 
mismatched words existing between the coinciding words decreases. 
The evaluation of the degree of coincidence considering the similarity of arranged order of 
words between a natural sentence and an original sentence and the number of mismatched 
words existing between words coinciding with the original sentence can be specifically 
realized by, for example, the following processing: 

[0093] That is to say, each word in the original sentence is provided with the first assessment 
value according to a difference (the number of words) from the phrase to be translated in the 
original sentence. 

This first assessment value can be defined so that the first assessment value for the phrase to 
be translated in the original sentence is the largest, and the first assessment value for each 
word other than the phrase to be translated in the original sentence decreases as the difference 
from the phrase to be translated in the original sentence increases (as the number of words 
existing in-between increases). 

For example, when an original sentence is composed of nine words A through J (original 
sentence = (A, B, C, D, E, F, Q H, J)) and the fourth word D in the original sentence is 
specified as a phrase to be translated, each word A through J in the original sentence can be 
provided with the following first assessment value: 
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A= 1.0, B = 2.0, C = 5.0, D = 10.0, E = 5.0, F = 2.0, G = 1.0, H = 0.5 and J = 0.2 
[0094] Then, each natural sentence containing the phrase to be translated extracted from the 
corresponding translation database by the retrieval in Step 102 is searched for a coinciding 
word coinciding with any word in the original sentence among each word in the natural 
sentence to calculate the sum of the first assessment value already provided to each of the 
found coinciding words. 

This sum of the first assessment value is assiuned to be the degree of coincidence, and the 
translated sentences of each natural sentence extracted by the retrieval are displayed in the 
descending order of the degree of coincidence of each correlated natural sentence (the sum of 
the first assessment value). 

For example, when natural sentences 1 through 5 having the following word arrangement are 

extracted by retrieval (where word X is an arbitrary word): 

Natural sentence 1 = (D, X, X, E, F, G) 

Natural sentence 2 = (D, X, E, F, G) 

Natural sentence 3 = (D, E, F, G) 

Natural sentence 4 = (A, B, X, C, X, D) 

Natural sentence 5 = (D, E, F, Q H, J) 

The sum of the first assessment value of each natural sentence is: 

Natural sentence 1 : 10.0 (word D) + 0 (word X) + 0 (word X) + 5.0 (word E) + 2.0 (word F) 
+ 1.0 (word G)= 18.0 

Natural sentence 2: 10.0 (word D) + 0 (word X) + 5.0 (word E) + 2.0 (word F) + LO (word 

G) =18.0 

Natural sentence 3: 10.0 (word D) + 5.0 (word E) + 2.0 (word F) + 1.0 (word G) = 18.0 
Natural sentence 4: 1 .0 (word A) + 2.0 (word B) + 0 (word X) + 5.0 (word C) + 0 (word X) + 
10.0 (word D) = 18.0 

Natural sentence 5: 10.0 (word D) + 5.0 (word E) + 2.0 (word F) + 1.0 (word G) + 0.5 (word 

H) + 0.2 (word J) -18.7, 

where natural sentence 5 of the sum (the degree of coincidence) of the first assessment value 
is the largest. 

[0095] The first assessment value is defined so that the values increases as the difference 
between the phrase to be translated in the original sentence and each word other than the 
phrase to be translated in the original sentence decreases. Therefore by evaluating each 
natural sentence containing the phrase to be translated based oh the sum of the first 
assessment value as mentioned above, the natural sentence containing many words existing in 
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the position close to the phrase to be translated in the original sentence, in other words, the 
natural sentence having high probability of containing many phrases composed of the phrase 
to be translated in the original sentence and the words located close to the phrase to be 
translated (the natural sentence of which similarity of arranged order of words with the 
original sentence is estimated to be high) can be implemented as the natural sentence having 
higher degree of coincidence. This mode corresponds to the invention of the fifth, twelfth, 
and sixteenth embodiments. 

[0096] Furthermore, the coincidence, i.e., total value of the first evaluated value, in the 
natural sentences 1-4 is equivalent in the above case, and when there are multiple natural 
sentences with equivalent coincidence based on the first evaluated value as in this case, the 
second evaluated value is granted according to the interval, i.e., the nimiber of words, fi-om 
the target phrase for rendition in the respective natural sentences toward the word 
corresponding to either of the words in the original text among the respective words in the 
natural sentence regarding the respective natural sentences with equivalent coincidence based 
on the first evaluated value. 

This second assessment value can also be defined so that the second assessment value for the 
phrase to be translated contained in each natural sentence is the largest and the value 
decreases as the diflFerence from the phrase to be translated in each natural sentence increases 
(as the number of words existing in-between increases). 

Thus the svun of the second assessment value provided to each coinciding word in each 
natural sentence having the same value of the degree of coincidence based on the first 
assessment value is calculated, and the translation of each natural sentence having the same 
value of the degree of coincidence based on the first assessment value is displayed in the 
descending order of the sum of the second assessment value of each correlated natural 
sentence (this is also included in the degree of coincidence relating to this invention). 
[0097] For example, each coinciding word in the aforementioned natural sentences 1 through 
4 is provided with the second assessment value to obtain the sum of the second assessment 
value as shown below. 

In the example below, the second assessment value of the phrase to be translated is assumed 
to be 10.0, and the second assessment value is defined so that the second assessment value of 
the other coinciding words is assumed to decrease like 5.0, 2.0, 1 .0, 0.5, 0.2, ... as the 
mmiber of words existing between the phrases to be translated increases like 0, 1, 2, 3, 4, . . .. 
Natural sentence 1 : 10.0 (word D) + 0 (word X) + 0 (word X) + 1 .0 (word E) + 0.5 (word F) 
+ 0.2 (word G) = 11.7 
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Natural sentence 2: 10.0 (word D) + 0 (word X) + 2.0 (word E) + 1 .0 (word F) + 0.5 (word 
G)=13.5 

Natural sentence 3: 10.0 (word D) + 5.0 (word E) + 2.0 (word F) + 1.0 (word G) = 18.0 
Natural sentence 4: 0.2 (word A) + 0.5 (word B) + 0 (word X) + 2.0 (word C) + 0 (word X) + 
10.0 (word D)= 12.7 

Therefore, natural sentences 1 through 4 (translated sentences) are displayed in the 
descending order of the sum of the second assessment value, namely in the order of natural 
sentences 3, 2, 4, 1. 

[0098] The second assessment value is defined so that the value increases as the difference 
between the coinciding word coinciding with any word in the original sentence among the 
words in each natural sentence and the phrase to be translated in each natural sentence 
decreases. Therefore the natural sentence containing a larger number of the words coinciding 
with the original sentence and containing the mismatched words existing between the 
coinciding words as small as possible, namely the natural sentence having a high probability 
that the natural sentence contains the phrase to be translated in the original sentence and the 
phrases composed of the words close to the phrase to be translated, can be implemented as 
the natural sentence having higher degree of coincidence by evaluating each natural sentence 
containing the phrase to be translated based on the sum of the second assessment values. 
Incidentally, the above aspect is also compatible with the invention of the fifth (see the sixth 
embodiment for details), twelfth, and seventeenth embodiments. 

[0099] The evaluation of the similarity of arranging order of words in natural sentences and 
original sentences and the degree of coincidence considering the nimiber of mismatched 
words in the natural sentences existing between the original sentence and the coinciding 
words can alsobe realized by, for example, the following processing: 

[0100] An upstream word having the shortest distance from the phrase to be translated (this 
time, an upstream word adjacent to the phrase to be translated) is extracted from an upstream 
word group existing in the upstream side of the phrase to be translated in the original 
sentence, and a downstream word having the shortest distance from the phrase to be 
translated (this time, a downstream word adjacent to the phrase to be translated) is extracted 
from a downstream word group existing in the downstream side of the phrase to be translated 
in the original sentence. 

Then the position where the phrase to be translated exists is assumed to be the reference 
position in each natural sentence containing the phrase to be translated extracted from the 
corresponding translation database by the retrieval (when the phrase to be translated exists in 
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a plurality of places in the natural sentence, the position where any phrase to be translated 
exists is assumed to be the reference position) to search the upstream region within the 
number of specified words (for example within three words) from the reference position to 
the upstream side for the upstream word akeady extracted and to search the downstream 
region within the nxraiber of specified words (for example within three words) from the 
reference position to the downstream side for the downstream word already extracted. 
The natural sentence in which the upstream and downstream words are foimd by the 
abovementioned searching is provided with a third assessment value defined so that the value 
increases as the distances between the reference position and the upstream word and between 
the reference position and the downstream word decrease. 

[0101] Then the upstream word not yet extracted and having the shortest distance from the 
upstream word group (this time, the upstream word between which and the phrase to be 
translated a single word (the upstream word used in the previous retrieval) exists) in the 
original sentence is extracted from the upstream word group, and the downstream word not 
yet extracted and having the shortest distance from the downstream word group (this time, 
the downstream word between which and the phrase to be translated a single word (the 
downstream word used in the previous retrieval) exists) in the original sentence is extracted 
from the downstream word group. 

The region within the specified number of words (for example within three words) from the 
position in which the upstream word was found in the previous retrieval to the upstream side 
in each natural sentence in which the upstream and downstream words were found by the 
previous retrieval is searched for the upstream word previously extracted, and the region 
within the specified number of words from the position in which the downstream word was 
found in the previous search of the downstream position in each natural sentence in which the 
upstream and downstream words were foxmd in the previous search is searched for the 
downstream word previously extracted. 

The natural sentence in which at least one of the upstream and downstream words is foimd by 
the search this time is provided with the third assessment value that is defined so that the 
value increases as the distance between the position of the upstream or downstream word 
found by the previous search and the position of the upstream or downstream word found by 
the search this time decreases. 

[0102] It is acceptable to use an assessment value that is equally defined only according to 
the distance between the position of the upstream or downstream word found by the previous 
search and the position of the upstream or downstream word found by the search this time as 
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the abovementioned third assessment value as mentioned above. However, considering the 
distance between the phrase to be translated and the upstream or downstream word to be 
searched in the original sentence, it is acceptable to define the third assessment value so that 
the influence of the upstream or downstream word to be searched on the third assessment 
value becomes greater as the distance between the upstream or downstream word to be 
searched and the phrase to be translated in the original sentence decreases (so that the third 
assessment value to be provided decreases totally as the distance between the upstream or 
downstream word to be searched for and the phrase to be translated in the original sentence 
increases). 

Also as described above, in place of setting dovra the third evaluated value in such a way as 
to increase the value along with the lessening interval in the natural sentence between the 
position of the anterior or posterior words detected in the preceding search and the position of 
the anterior or posterior words detected in the current search, the third evaluated value may 
be laid down so as to increase the value along with the lessening interval in the natural 
sentence between the position of the aforementioned reference position and the position of 
the anterior or posterior words detected in the current search. 

[0103] The upstream and downstream words are extracted from the original sentence in 
ascending order of the distance from the phrase to be translated to repeat the above 
processing imtil the time any upstream or downstream word extractable from the original 
sentence remains no more. Then the sum of the third assessment values provided to each 
natural sentence is calculated and assiuned to be the degree of coincidence, and the 
translation of each natural sentence, extracted by the retrieval is displayed in the descending 
order of the degree of coincidence (the siun of the third assessment values) of each 
corresponding natural sentence. 

[0104] For example, when an original sentence is composed of five words A through E 
(original sentence = (A, B, C, D, E)) and the third word C in the original sentence is specified 
as a phrase to be translated, first words B and D are extracted from the original sentence as an 
upstream word and a downstream word respectively to search the region within the specified 
number of words (for example within three words) from the phrase to be translated to the 
upstream side for the upstream word and to search the region within the specified munber of 
words (for example within three words) from the phrase to be translated to the downstream 
side for the downstream word in each natural sentence containing the phrase to be translated 
extracted from the corresponding translation database by the retrieval to provide the natural 
sentence in which the upstream and downstream words are found with the third assessment 
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Then words A and E are extracted from the original sentence as an upstream word and a 
downstream word respectively to search the region within the specified number of words (for 
example within three words) from the upstream word to the upstream side for the upstream 
word A and to search the region within the specified number of words (for example within 
three words) from the downstream word D to the downstream side for the downstream word 
E in the natural sentence in which the upstream word B and the downstream word D were 
found to provide the natural sentence in which the upstream word A and the downstream 
word E were found with the third assessment value. 

[0105] The abovementioned processing is applied to ten natural sentences (a) through (k) 
shown below and the sentences are arranged in the descending order of the sum of the third 
assessment value as shown below, where word X and mean an arbitrary word and a 
punctuation mark respectively. 

1. Natural sentence (a) = (A, B, C, D, E) 

2. Natural sentence (b) = (A, B, C, D, /, B, C, D, E) 

3. Natural sentence (c) = (B, C, D) 

4. Natural sentence (d) = (A, B, C, X, D, E) 

5. Natural sentence (e) = (A, B, C, X, D) 

6. Natural sentence (f) = (A, B, C, /, C, D, E) 

7. Natural sentence (g) = (A, X, B, C, X, D) 

8. Natural sentence (h) = (B, C, /, C, D) 

9. Natural sentence (j) = (B, X, C, /, C, X, D) 

10. Natural sentence (k) = (B, X, X, C, /, C, X, X, D) 

By providing each natural sentence containing the phrase to be translated with the third 
assessment value as shown above and evaluating based on the sum of the third assessment 
value, the natural sentence having high similarity of arranging order of word with the original 
sentence and having the number of mismatched words as small as possible existing between 
the words coinciding with the original sentence, that is to say, the natural sentence containing 
the original sentence in the condition that the natural sentence contains the mismatched words 
as small as possible (in the condition of high integrity) can be implemented as the natiu-al 
sentence having a higher degree of coincidence. This mode corresponds to the invention of 
the seventh, thirteenth, and eighteenth embodiments. 

[0106] The evaluation considering the similarity of arranged order of words between the 
natural sentence and the original sentence and the number of mismatched words in the natural 
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sentence existing between the words coinciding with the original sentence can be realized by, 
for example, the following processing: 

[0107] First, the upstream word having the shortest distance from the phrase to be translated 
(this time the upstream word adjacent to the phrase to be translated) is extracted from the 
upstream word group existing in the upstream side of the phrase to be translated in the 
original sentence, then the position in which the phrase to be translated exists in the natural 
sentence is assxmied to be the reference position in each natural sentence containing the 
phrase to be translated extracted from the corresponding translation database by the retrieval 
(when the phrase to be translated exists in each of a plurality of places in the natural sentence, 
the position in which any one of the phrase to be translated exists is asstmied to be the 
reference position), and the region within the specified number of words (for example three 
words) from the reference position to the upstream side is searched for the upstream word 
previously extracted. 

This processing coxmts the number of words by distance (more specifically the first number 
of words by distance and the second number of words by distance) as a fourth assessment 
value from each of the natural sentences containing the phrase to be translated, and measures 
the distance between the reference position and the upstream word in the natural sentence in 
which the upstream word was foimd in the abovementioned retrieval to determine the first 
number of words by distance according to each measured distance. 

[0108] The upstream word not yet extracted and having the shortest distance from the phrase 
to be translated (the upstream word between which and the phrase to be translated a single 
word (the upstream word used for the previous retrieval) exists) is extracted from the 
upstream word group, the region within the specified number of words (for example, within 
three words) from the position in which the upstream word previously extracted was found in 
the previous searching to the upstream side in each natural sentence in which the upstream 
word was found in the previous search is searched for the upstream word previously extracted, 
the distance between the position of the upstream word found in the previous search and the 
upstream word found this time is measured, and the distance between the upstream word 
found in the previous search and the reference position is added to the measured, distance to 
obtain the distance between the upstream word found in the search this time and the reference 
position and to determine the first number of words by distance according to each of the 
obtained distances. 

The upstream words are extracted from the original sentence in the ascending order of the 
distance from the phrase to be translated to repeat the abovementioned processing until the 
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time any upstream word extractable from the original sentence remains no more. 
[0109] For example, when an original sentence is composed of five words A through E 
(original sentence = (A, B, C, D, E)) and the fourth word D in the original sentence is 
specified as the phrase to be translated, first, the word C is extracted from the original 
sentence as an upstream word, and the region within the specified number of words (for 
example, three words) from the phrase to be translated to the upstream side in each natural 
sentence containing the phrase to be translated extracted from the corresponding translation 
database by the retrieval is searched for the upstream word to determine the first number of 
words by distance in the natural sentence in which word C was foimd. 
An example of a result of counting the first number of words by distance at this point is 
shown in Table 1 below: 
[0110] Table 1. 

[0111] Then word B is extracted from the original sentence as the upstream word, and the 
region within the specified number of words (for example, three words) from the upstream 
word C to the upstream side in the natural sentence in which the upstream word C was found 
in the previous searching is searched for the upstream word B to determine the first number 
of words by distance in the natural sentence in which the upstream word B was found. 
An example of a result of coxmting the first number of words by distance at this point is 
shown in Table 2 below: 
[0112] Table 2. 

[01 13] Further, word A is extracted from the original sentence as the upstream word, and the 
region within the specified number of words (for example, three words) from the upstream 
word B to the upstream side in the natural sentence in which the upstream word B was foimd 
in the previous searching is searched for the upstream word A to determine the first number 
of words by distance in the natural sentence in which the upstream word A was found. 
An example of a result of counting the first number of words by distance at this point is 
shown in Table 3 below: 
[3fetT] 

[0114] Table 3. 

[0115] The distance between words is measured in each natural sentence containing the 
phrase to be translated extracted from the corresponding translation database by the retrieval 
in a method different from the abovementioned first number of words by distance to simi up 
as the second number of words by distance. 
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More specifically, in the condition that the existence of the upstream word adjacent to the 
phrase to be translated among the upstream word group in the original sentence is ignored 
(word C contained in each natural sentence is regarded as an arbitrary mismatch of word X), 
the upstream word having the shortest distance from the phrase to be translated is extracted 
from the upstream word group in the original sentence, and the position in which the phrase 
to be translated exists in each natural sentence is assiuned to be the reference position in each 
natural sentence containing the phrase to be translated extracted from the corresponding 
translation database by the retrieval to search the region within the specified number of words 
(for example, three words) from the reference position to the upstream side for the extracted 
upstream word. 

The distance between the reference position and the upstream word (more specifically, the 
number of mismatched words existing between the reference position and the upstream word) 
is measured in the natural sentence in which the upstream word was found in the previous 
searching, and the distance obtained by adding "1" to the measured distance is assumed to be 
the distance of the reference position to determine the second number of words by distance 
according to each distance from the reference position. 

[0116] The upstream word not yet extracted and having the shortest distance from the phrase 
to be translated is extracted from the upstream word group, the region within the specified 
number of words (for example, three words) from the position of the upstream word found in 
the previous search is searched for the previously extracted upstream word in each natural 
sentence in which the upstream word was found in the previous search, the distance between 
the position of the upstream word found in the previous search and the upstream word foimd 
this time is measured in the natural sentence in which the upstream word is found in the 
search this time, and the distance between the upstream word found in the previous search 
and the reference position is added to the measured distance to obtain the distance between 
the upstream word found in the search this time and the reference position and to determine 
the second number of words by distance according to the obtained distance. 
The upstream words are extracted from the original sentence in the ascending order of the 
distance from the phrase to be translated to repeat the abovementioned processing imtil the 
time any upstream word extractable from the original sentence remains no more. 
Wt] 

[0117] For example, when an original sentence is composed of five words A through E 
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(original sentence = (A, B, C, D, E)) and the fourth word D in the original sentence is 
specified as the phrase to be translated, first in the condition that existence of the upstream 
word C adjacent to the phrase to be translated among the upstream word group in the original 
sentence is ignored, word B and word A are extracted from the original sentence in sequence 
to measure the distance from the reference position in each natural sentence to obtain the 
result of measurement as shown in Table 4 below: 
[0118] [Table 4] 

[0119] The second number of words by distance is determined based on the abovementioned 
measurement result to obtain the counting result of the second number of words by distance 
as shown in Table 5. 
[0120] [Tables] 

[0121] As mentioned above, when the counting of the first number of words by distance and 
the coimting of the second number of words by distance concerning the upstream word group 
in the original sentence are finished in each natural sentence containing the phrase to be 
translated, the counting results of the first number of words by distance and the second 
number of words by distance are compared in each natural sentence, and the coimting result 
showing the result that the distance between words is smaller is selected from the first 
number of words by distance and the second number of words by distance as a final 
evaluation. 

More specifically, for example, a natural sentence (A, B, C, D) has a distance of 0 and the 
nimiber of words 3 in the first number of words by distance, while the natural sentence has 
the number of words 0 at a distance of 0, the nimiber of words 0 at a distance of 1 and the 
number of words 2 at a distance of 2 in the second number of words by distance as shown in 
Table 5. Therefore, the first number of words by distance having larger number of words at a 
distance of 0 is selected as the final evaluation for the upstream word group of the natural 
sentence (A, B, C, D). 

The abovementioned example is for the case that the niunbers of words at a distance of 0 of 
the first number of words by distance and the second number of words by distance are 
different. When the numbers of words at a distance 0 are identical to each other, the numbers 
of words at distances 1,2, ... are compared in sequence and the number of words having 
larger number at the same distance is selected from the first number of words by distance and 
the second number of words by distance as the final evaluation. 

[0122] The purpose of counting the second number of words by distance is to correctly 
evaluate the similarity of the arranged order of words with the original sentence also for the 
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natural sentence in which the order of a part of words is changed from the original sentence. 
For example, in a natural sentence (C, A, B, D), the niraiber of words is 0 at a distance of 0, 
the number of words 0 at a distance of 1 and the nimiber of words 1 at a distance 2 in the first 
number of words by distance, while the nimiber of words is 0 at a distance of 0, the nmnber 
of words 2 at a distance of 1 , and the number of words 0 at a distance of 2 in the second 
number of words by distance. Thus, the second number of words by distance is selected as 
the final evaluation for the upstream word group in the abovementioned natural sentence. 
In the abovementioned example, the counting result for the first nimiber of words by distance 
is evaluated by regarding the natural sentence (C, A, B, D) as (C, X, X, D), while the 
counting result for the second number of words by distance is evaluated by regarding the 
natural sentence (C, A, B, D) as (X, A, B, (word C removed), D). Therefore, the 
abovementioned natural sentence can be evaluated that the original sentence is contained in 
the condition of higher integrity. 

[0123] When the final evaluation is determined by performing the abovementioned 
processing for each natural sentence containing the phrase to be translated extracted from the 
corresponding translation database by the retrieval, the similar processing (counting of the 
first nimiber of words by distance and the second number of words by distance, and 
determination of the final evaluation conceming the downstream word group) is performed 
using the downstream word group existing in the downstream side of the phrase to be 
translated in the original sentence for each of the abovementioned natural sentences. 
Then the obtained final evaluation for the upstream word group and the final evaluation for 
the downstream word group are summed (the number of words by distance in each final 
evaluation is summed by the same distance) for each natural sentence to obtain a 
comprehensive evaluation for each natural sentence. 

The translated sentence of each natural sentence is displayed in ascending order of the 
distance between words shown by the comprehensive evaluation for each of the 
corresponding natural sentences based on the comprehensive evaluation for each natural 
sentence. 

[0124] More specifically, for example, by repeating rearrangement of each natural sentence in 
the descending order of the number of words at a distance of 0 and the subsequent 
rearrangement of the natural sentence having the same number of words at a distance of 0 in 
descending order of the number of words at a distance of 1 while increasing the distance 
corresponding to the number of words to be compared at an increment of 1, each sentence is 
rearranged in ascending order of the distance between words shown by the comprehensive 
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evaluation, and the translated sentences corresponding to each natural sentence are displayed. 

And, the translated sentences corresponding to each natural sentence are displayed. 

[0125] According to this, nine natural sentences (a) through (j) shown below are processed as 

mentioned above to rearrange the natural sentences based on the comprehensive evaluation, 

resulting in the following order: 

[SSlfT] 

L Natural sentence (a) = (A, B, C, D, E) 

2. Natural sentence (b) = (A, B, C, D, /, B, C, D, E) 

3. Natural sentence (c) = (A, B, C, X, D) 

4. Natural sentence (d) = (B, C, D) 

5. Natural sentence (e) = (A, X, B, C, X, D) 

6. Natural sentence (f) = (B, C, X, D) 

7. Natural sentence (g) = (B, C, /, C, D) 

8. Natural sentence (h) = (B, X, C, /, C, X, D) 

9. Natural sentence (j) = (B, X, X, C, /, C, X, X, D) 

In the abovementioned processing, the counting of the second number of words by distance is 
not essential. When it is not necessary to evaluate the natural sentence for which words are 
partially rearranged from the original sentence, it is acceptable to omit the coimting of the 
second number of words by distance and use the first number of words by distance as the 
final evaluation. Incidentally, the above aspect corresponds to the inventions of the eighth, 
fourteenth, and nineteenth embodiments, and especially, the aspect using the number of 
words by the first distance in combination with the second acconmiodates the invention of the 
ninth embodiment. 

[0126] It is a matter of course that the evaluation of degree of coincidence and selection or 
rearrangement of natural sentences based on the abovementioned first assessment value and 
the second assessment value/third assessment value/fourth assessment value (the first number 
of words by distance and the second number of words by distance) can be commonly used 
with evaluation, selection, or rearrangement of natural sentences by a correlation analysis or a 
scheme analysis. The mode for evaluating the degree of coincidence of natural sentences 
based on the first assessment value and the second assessment value or the third assessment 
value is more suitable for the case when retrieving the translated sentence to be used for 
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reference as for the phrases within the range narrower than the entire original sentence, such . 
as phrases existing in the original sentences in comparison with the modes, etc., for 
evaluating the degree of coincidence using only the number of coinciding words between the 
original sentence and the natural sentence. For example, it is acceptable to provide a retrieval 
mode, such as a phrase retrieving mode, in addition to the general retrieval mode, and to 
perform the evaluation of the degree of coincidence based on the first assessment value and 
the second assessment value or the third assessment value and the selection or rearrangement 
of the natural sentences when this phrase retrieval mode is selected. 

[0127] Although an example wherein the degree of coincidence is first calculated to select a 
natural sentence (identify the corresponding translation), a natural sentence is selected 
(corresponding translation is identified) by a correlation analysis when the suitable natural 
sentence cannot be narrowed only based on the degree of coincidence, and a natural sentence 
is selected (corresponding translation is identified) by a scheme analysis when the suitable 
natural sentence cannot be narrowed dovra even based on the correlation analysis was 
described above, the application is not limited to this. It is acceptable to calculate the degree 
of coincidence for individual natural sentences containing the phrase to be translated 
extracted by the retrieval of the retrieval means, to evaluate the individual natural sentences 
by performing the correlation analysis and the scheme analysis, and to select a natural 
sentences according to the priority order, for example, shown in Table 6 below based on the 
evaluation results. 
[0128] Table 6. 
[3fetT] 

[0129] Furthermore, it is acceptable to calculate the degree of coincidence for individual 
natural sentences, to perform each of the correlation analysis and the scheme analysis, and to 
select the natural sentence having the largest comprehensive evaluation score. 
Comprehensive evaluation score = a x (degree of coincidence) + b x (correlation evaluation 
score) + c X (scheme evaluation score) 

where the correlation evaluation score is an evaluation score provided to the individual 
natural sentences based on the result of the correlation analysis, the scheme evaluation score 
is an evaluation score provided to the individual natural sentences based on the result of the 
scheme analysis, and a, b and c are predefined factors. 

[0130] The program was formulated in the translation identifying processing (Figure 2) 
previously described so that whether the natural sentences completely coinciding with the 
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original sentence to be processed is extracted from the corresponding translation database by 
the retrieval in Step 102 or not is asked (Step 104), and if the answer is positive, the 
translated sentence of the natural sentence completely coinciding with the original sentence to 
be processed is read and displayed. However, the application is not limited to this. It is 
acceptable to omit the above-mentioned Steps 104 and 106 and to unconditionally perform 
the processing in Step 108 and the subsequent steps (processing such as calculating the 
degree of coincidence) without relating to whether any natural sentence completely 
coinciding with the original sentence to be processed is extracted by the retrieval in Step 102 
or not. 

In this mode, also when a plurality of natural sentences is extracted by the retrieval in Step 
102, the degree of coincidence of the natural sentence is maximized if the natural sentence 
completely coinciding with the original sentence is contained in a plurality of the extracted 
natural sentences. Therefore, the same processing result as when Steps 104 and 106 are 
performed can be obtained. 

[0131] Accuracy of selecting a translation in this invention depends on the numbers of natural 
sentences and the translated sentences registered in the corresponding translation database, 
and the accuracy of selecting a translation improves as the numbers of natural sentences and 
the translated sentences increase. 

Therefore, it is acceptable to read original sentences to be translated and translated sentences 
in the translation-identifying system relating to this invention, and to automatically and 
additionally register the read natural sentences and translated sentences as they were or after 
performing selection iii the corresponding translation database. 

According to this invention, it is necessary only to additionally register the natural sentences 
and the translated sentences in the corresponding translation database, and it is not necessary 
to perform editing or processing, such as division of natural sentences by word unit. 
Therefore, it is very easy to automatically and additionally register information (the natural 
sentences and the translated sentences) in the corresponding translation database. 
[0132] Although an example of the mode for highlighting the translation of the phrase to be 
translated when displaying a translated sentence was described above, the mode is not limited 
to this. It is a matter of course that the mode not highlighting the translation of the phrase to 
be translated is also included in the scope of the right of this invention. 
[0133] The accuracy of selecting a translation in this invention also depends on redundancy 
of the contents of the natural sentences and the translated sentences registered in the 
corresponding translation database. AVhen the redundancy of the contents of the natural 
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sentences and the translated sentences registered in the corresponding translation database 
heightens, the accuracy of selecting a translation lowers in proportion to the numbers of the 
natural sentences and the translated sentences registered in the corresponding translation 
database (capacity of the corresponding translation database). 

Therefore, when the individual natural sentences and translated sentences registered in the 
corresponding translation database is searched for natural sentences and the translated 
sentences having a high similarity of contents, and the natural sentences and translated 
sentences having a high similarity of contents are foimd, it is acceptable to provide the 
function for removing a set of the natural sentences and translated sentences from the 
corresponding translation database. 

[0134] Although an example of selecting a translation using a single corresponding 
translation database was described above, the application is not limited to this. It is acceptable 
to divide the corresponding translation database by field of natural sentences when the 
original sentences to be processed are limited to a specific field such that the program is used 
for translating a manual of a specific field. 

Thus, the increase of capacity for the individual corresponding translation database can be 
avoided and the effects, such as saving time for retrieving natural sentences and improvement 
of the accuracy of selecting a translation, are obtained. 

[0135] Although an example of a mode in which the translation identifying processing is run 
by PC 10 according to the specified operation for commanding to display the translation of 
the phrase to be translated performed by a user through PC 10 was described above, the 
application is not Umited to this. It is possible to configure by applying the invention to a 
computer system 38 where both a client PC 32 and a server 34 are connected to a network 30, 
such as the Internet or LAN (Local Area Network), and a storage medium 36 composed of a 
HDD, etc., for storing the corresponding translation database is connected to server 34 so that, 
when a user asks a translation on line through client PC 32, server 34 identifies and provides 
the suitable translation on-line as shown in Figure 4. 

[0136] In computer system 38 shown in Figure 4, a text of an original sentence is specified in 
order to enable a user to make an inquiry on a translation (see (1) in Figure 4) by, for example, 
the operation that the user transmits text data of the source language to server 34 through 
client PC 32 or the user communicates the URL (uniform resource locator) of the Web page 
containing the text of the source language to server 34. 

In this case, it is possible to configure the system so that server 34 executes the translation 
identifying processing while accessing the corresponding translation database stored in 
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storage medium 36 to identify the translation (sentence) of the specified text (see (2) in. 
Figure 4) and transmit the identified translation (sentence) to client PC 32 to answer to the 
inquiry (see (3) in Figure 4). 

Since this configuration allows a plurality of users to share the translation identifying 
function (and the corresponding translation database) of server 34, it is effective for reducing 
cost especially when the capacity required for the corresponding translation database is large. 
[0137] Although a mode for applying this invention to an electronic dictionary system was 
described above as an example, the application is not limited to this. It is a matter of course 
that this invention is also applicable to a machine translation (automatic translation) system, 
an automatic interpretation system (a system for performing interpretation by recognizing an 
acoustically entered original sentence by acoustic recognition, identifying the translated 
sentence of the recognized original sentence, and transmitting the identified translated 
sentence acoustically), a language learning system, etc. 

Figure 5 is a schematic diagram showing a configuration of a machine translation system 40 
to which this invention is applied. 

In Figure 5, an original sentence input section 42 inputs (text data of) an original sentence to 
be translated to both a section for selecting a phrase to be translated 44 and a translated 
sentence assembly section 46. 

Section for selecting a phrase to be translated 44 repetitively selects a specific phrase in the 
entered original sentence to be translated as a phrase to be translated and asks a translation 
identifying section 48 to retrieve a translation of the selected phrase to be translated while 
selecting an individual phrase in the original sentence to be translated as the phrase to be 
translated in sequence. 

Translation identifying section 48 is a section corresponding to the translation-identifying 
system relating to this invention and executes translation-identifying processing while 
accessing a corresponding translation database stored in a storage medium 50 to identify the 
translation of the inquired phrase to be translated and to transmit the identified translation to 
translated sentence assembly section 46. 

[0138] Translated sentence assembly section 46 links translations (changes arranging order if 
necessary) transmitted fi-om translation identifying section 48 to assemble the translated 
sentence in a target language based on the phrases to be translated transmitted from section 
for selecting a phrase to be translated 44, their translations transmitted from translation 
identifying section 48 and the original sentence of the phrase to be translated transmitted 
from original sentence input section 42. 
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The translated sentence assembled in translated sentence assembly section 46 is transmitted 

to translated sentence output section 52 to perform translated sentence output processing, 

such as displaying on a display, recording to a storage medium, acoustic output, etc. 

In the above-mentioned configuration, since this invention is applied to identification of the 

translation of the plu'ase to be translated, the translation having high probability of obtaining 

a natural translated sentence as a sentence in the target language is obtained fi-om the original 

sentence, and the translated sentence transmitted firom translated sentence output section 52 

becomes a natural translated sentence as a sentence in the target language. 

Moreover, since no complicated processing such as identifying a part of speech or analyzing 

a sentence structure is required, the process can be simplified. 

It is a matter of course that, also in Figure 5, it is acceptable to transmit on-line the translated 
sentence obtained through the processing such as identifying and assembling the translations 
to answer the inquiry of translated sentence made on-line as shown in Figure 4 already 
explained. 

[0139] Although one of a pair of correlated sentences (a pair of sentences described in 
different languages) registered in the corresponding translation database (storage means) and 
the other of the pair were called a natural sentence and a translated sentence respectively, it is 
possible to use corresponding translation database created for translation (identification of 
translated sentence) fi-om a first language to a second language as the database for translation 
(identification of translated sentence) from the second language to the first language as 
described above. 

[0140] Although the cases of English, Japanese, and Chinese are described above as an 

example of a source language or a target language, the languages are not limited to these. It is 

a matter of course that, for example, French, German, Italian, Spanish, Portuguese, etc. are 

applicable as the original or target language. 

Description of the Reference Numerals 
[0141] 10 PC 

18 HDD 

34 Server 

36 Storage medium 
40 Machine translation system 
48 Translation identifying section 
50 Storage medium 



